Skip to content

DBpedia Spotlight is a tool for automatically annotating mentions of DBpedia resources in text. Improving Efficiency and Accuracy in Multilingual Entity Extraction approach

License

Notifications You must be signed in to change notification settings

gharyong/dbpedia-spotlight-model

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

35 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

DBpedia Spotlight Build Status

Links

website - http://www.dbpedia-spotlight.org

status service - http://status.dbpedia-spotlight.org

download service - http://download.dbpedia-spotlight.org

demo service - http://demo.dbpedia-spotlight.org

CI -http://jenkins.dbpedia-spotlight.org

General Notes

Since v1.0, DBpedia Spotlight was split into two versions, under the same API, as follow:

This important movement was the way that we found to deliver faster fixes and new releases, providing solutions for each annotation approach.

Our first achievement is related with licensing. DBpedia Spotlight Model is now full compliance with Apache 2.0. It means that you can use it without any commercial restrictions.

We are so excited because there's even more great news to come.

If you require any further information, feel free to contact us via [email protected]. We are already very excited to spend time with you on further community meetings and to publish new DBpedia releases.

Keep annotating,

All the best

Shedding Light on the Web of Documents

DBpedia Spotlight looks for ~3.5M things of unknown or ~320 known types in text and tries to link them to their global unique identifiers in DBpedia.

Demonstration

Go to our Demonstration page, copy+paste some text and play with the parameters to see how it works.

Endpoints

http://api.dbpedia-spotlight.org/{LANGUAGE}/annotate

Call our web service

You can use our demonstration Web Service directly from your application.

curl http://api.dbpedia-spotlight.org/en/annotate  \
  --data-urlencode "text=President Obama called Wednesday on Congress to extend a tax break
  for students included in last year's economic stimulus package, arguing
  that the policy provides more generous assistance." \
  --data "confidence=0.35"

or for JSON:

curl http://api.dbpedia-spotlight.org/en/annotate  \
  --data-urlencode "text=President Obama called Wednesday on Congress to extend a tax break
  for students included in last year's economic stimulus package, arguing
  that the policy provides more generous assistance." \
  --data "confidence=0.35" \
  -H "Accept: application/json"

Run your own server

If you need service reliability and lower response times, you can run DBpedia Spotlight in your own In-House Server. Just download a model and Spotlight from here to get started.

wget http://downloads.dbpedia-spotlight.org/spotlight/dbpedia-spotlight-1.0.0.jar
wget http://downloads.dbpedia-spotlight.org/2016-16/en/model/en.tar.gz
tar xzf en.tar.gz
java -jar dbpedia-spotlight-1.0.jar en http://localhost:2222/rest

Models and data

Models and raw data for most languages are available here.

Citation

If you use DBpedia Spotlight in your research, please cite the following paper:

@inproceedings{isem2013daiber,
  title = {Improving Efficiency and Accuracy in Multilingual Entity Extraction},
  author = {Joachim Daiber and Max Jakob and Chris Hokamp and Pablo N. Mendes},
  year = {2013},
  booktitle = {Proceedings of the 9th International Conference on Semantic Systems (I-Semantics)}
}

Licenses

All the original code produced for DBpedia Spotlight Model is licensed under Apache License, 2.0.

Documentation

More documentation is available from the DBpedia Spotlight wiki.

FAQ

Check the FAQ here

Maintainers

About

DBpedia Spotlight is a tool for automatically annotating mentions of DBpedia resources in text. Improving Efficiency and Accuracy in Multilingual Entity Extraction approach

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Scala 56.1%
  • Java 42.1%
  • Python 1.2%
  • Other 0.6%