Abstract
If you see Wikipedia as a main place where the knowledge of mankind is concentrated, then DBpedia—which is extracted from Wikipedia—is the best place to find the machine representation of that knowledge. DBpedia constitutes a major part of the semantic data on the web. Its sheer size and wide coverage enables you to use it in many kind of mashups: it contains biographical, geographical, bibliographical data; as well as discographies, movie metadata, technical specifications, and links to social media profiles and much more. Just like Wikipedia, DBpedia is a truly cross-language effort, e.g., it provides descriptions and other information in various languages. In this chapter we introduce its structure, contents, and its connections to outside resources. We describe how the structured information in DBpedia is gathered, what you can expect from it and what are its characteristics and limitations. We analyze how other mashups exploit DBpedia and present best practices of its usage. In particular, we describe how Sztakipedia—an intelligent writing aid based on DBpedia—can help Wikipedia contributors to improve the quality and integrity of articles. DBpedia offers a myriad of ways to accessing the information it contains, ranging from SPARQL to bulk download. We compare the pros and cons of these methods. We conclude that DBpedia is an unavoidable resource for applications dealing with commonly known entities like notable persons, places; and for others looking for a rich hub connecting other semantic resources.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
Since 2007, see http://stats.wikimedia.org/EN/#see_also.
- 2.
- 3.
- 4.
- 5.
- 6.
As of August 2012. For more details, see: http://wiki.dbpedia.org/Dataset.
- 7.
- 8.
Metaweb Technologies, Inc. It has been acquired by Google in 2010.
- 9.
Especially the notability test: https://en.wikipedia.org/wiki/Wikipedia:Notability.
- 10.
- 11.
- 12.
- 13.
- 14.
- 15.
- 16.
For a screencast see: http://www.yovisto.com/labs/vissw2011/.
- 17.
Deutsche Nationalbibliothek—DNB.
- 18.
Personennamendatei—PND.
- 19.
- 20.
- 21.
- 22.
- 23.
- 24.
- 25.
- 26.
- 27.
If you want to do your own research, use Google Scholar and search for “DBpedia: a nucleus for a web of open data” and “DBpedia—a crystallization point for the Web of Data”. The two articles together received a remarkable 1,300 citations to date.
- 28.
- 29.
- 30.
- 31.
More precisely it has a free version, besides the enterprise plan.
- 32.
- 33.
See literature on information extraction.
- 34.
- 35.
- 36.
In the early stage of Sztakipedia project our team also developed a TinyMCE-based editor, but that is discontinued now.
- 37.
tf-idf is a widely used statistical relevance measure. For details, see [23].
- 38.
- 39.
UIMA stands for Unstructured Information Management Architecture. It is a modular framework for annotating content. For more details, see http://uima.apache.org/.
- 40.
- 41.
These are the old and new machine interface standards supported by most library systems.
- 42.
References
Allemang D, Hendler JA (2008) Semantic web for the working ontologist: effective modeling in RDF and OWL. Morgan Kaufmann, San Mateo
Amin MS, Jamil H (2010) An efficient web-based wrapper and annotator for tabular data. Int J Softw Eng Knowl Eng 20(2):215
Auer S, Bizer C, Kobilarov G, Lehmann J, Ives Z (2007) DBpedia: a nucleus for a web of open data. In: 6th international semantic web conference, Busan, Korea. Springer, Berlin, pp 11–15
Baader F (2003) The description logic handbook: theory, implementation, and applications. Cambridge University Press, Cambridge
Becker C, Bizer C (2008) DBpedia mobile: a location-enabled linked data browser. In: Linked data on the web (LDOW)
Bizer C, Lehmann J, Kobilarov G, Auer S, Becker C, Cyganiak R, Hellmann S (2009) DBpedia—a crystallization point for the web of data. Web Semant Sci Serv Agents World Wide Web 7(3):154–165. doi:10.1016/j.websem.2009.07.002
Bollacker K, Evans C, Paritosh P, Sturge T, Taylor J (2008) Freebase: a collaboratively created graph database for structuring human knowledge. In: Proceedings of the 2008 ACM SIGMOD international conference on management of data, SIGMOD’08. ACM, New York, pp 1247–1250
Chan B, Talbot J, Wu L, Sakunkoo N, Cammarano M, Hanrahan P (2009) Vispedia: on-demand data integration for interactive visualization and exploration. In: Proceedings of the 35th SIGMOD international conference on management of data. ACM, New York, pp 1139–1142
Cyganiak R, Jentzsch A. Linking open data cloud diagram. http://lod-cloud.net/
DiFranzo D, Graves A, Erickson JS, Ding L, Michaelis J, Lebo T, Patton E, Williams GT, Li X, Zheng JG et al. (2011) The web is my back-end: creating mashups with linked open government data. In: Linking government data, pp 205–219
Ferrucci D, Lally A (2004) UIMA: an architectural approach to unstructured information processing in the corporate research environment. Nat Lang Eng 10(3–4):327–348
Giles J (2005) Internet encyclopaedias go head to head. Nature 438(7070):900–901
Heath T, Motta E (2008) Revyu: linking reviews and ratings into the web of data. Web Semant Sci Serv Agents World Wide Web 6(4):266–273
Héder M (2011) Integrating artificial intelligence solutions into interfaces of online knowledge production. ICIC Express Lett 5(12):4395–4401
Héder M, Farkas M, Oláh T, Solt I (2011) Sztakipedia: mashing up natural language processing, recommender systems and search engines to support Wiki article editing. In: AI mashup challenge 2011 at extended semantic web conference (ESWC 2011), Iraklion, Crete, Greece
Hogan A, Harth A, Umbrich J, Kinsella S, Polleres A, Decker S (2011) Searching and browsing linked data with SWSE: the semantic web search engine. Web Semant Sci Serv Agents World Wide Web 9(4):365–401. doi:10.1016/j.websem.2011.06.004
Jain P, Hitzler P, Yeh PZ, Verma K, Sheth AP (2010) Linked data is merely more data. In: Linked data meets artificial intelligence, pp 82–86
Kobilarov G, Scott T, Raimond Y, Oliver S, Sizemore C, Smethurst M, Bizer C, Lee R (2009) Media meets semantic web—how the BBC uses DBpedia and linked data to make connections. In: The semantic web: research and applications, pp 723–737
Korica-Pehserl P, Latif A (2011) Meshing semantic web and Web 2.0 technologies to construct profiles: case study of Academia Europea members. In: Networked digital technologies, pp 334–344
Mcguinness DL, Fikes R, Hendler J, Stein LA (2002) Daml+oil: an ontology language for the semantic web. IEEE Intell Syst 17(5):72–80
Mendes PN, Jakob M, García-Silva A, Bizer C (2011) DBpedia spotlight: shedding light on the web of documents. In: Proceedings of the 7th international conference on semantic systems (I-semantics), pp 1–8. doi:10.1145/2063518.2063519
Passant A (2010) dbrec: music recommendations using DBpedia. In: Proceedings of the 9th international semantic web conference on the semantic web, ISWC’10, vol II. Springer, Berlin, pp 209–224
Salton G, Buckley C (1988) Term-weighting approaches in automatic text retrieval* 1. Inf Process Manag 24(5):513–523
Waitelonis J, Osterhoff J, Sack H (2011) More than the sum of its parts: Contentus—a semantic multimodal search user interface. In: Proceedings of workshop on visual interfaces to the social and semantic web (VISSW), co-located with ACM IUI, vol 13
Wood A, Struthers K (2010) Pathology education, Wikipedia and the Net generation. Med Teacher 32(7):618
Acknowledgements
We are deeply indebted to Domonkos Tikk and Pablo Mendes for their valuable comments on the text. The authors were partially supported by the grant TÁMOP-4.2.2.B-10/1-2010-0009 of the Hungarian National Development Agency (NFÜ).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Héder, M., Solt, I. (2013). DBpedia Mashups. In: Endres-Niggemeyer, B. (eds) Semantic Mashups. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-36403-7_4
Download citation
DOI: https://doi.org/10.1007/978-3-642-36403-7_4
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-36402-0
Online ISBN: 978-3-642-36403-7
eBook Packages: Computer ScienceComputer Science (R0)