Skip to main content

DBpedia Mashups

  • Chapter
Semantic Mashups

Abstract

If you see Wikipedia as a main place where the knowledge of mankind is concentrated, then DBpedia—which is extracted from Wikipedia—is the best place to find the machine representation of that knowledge. DBpedia constitutes a major part of the semantic data on the web. Its sheer size and wide coverage enables you to use it in many kind of mashups: it contains biographical, geographical, bibliographical data; as well as discographies, movie metadata, technical specifications, and links to social media profiles and much more. Just like Wikipedia, DBpedia is a truly cross-language effort, e.g., it provides descriptions and other information in various languages. In this chapter we introduce its structure, contents, and its connections to outside resources. We describe how the structured information in DBpedia is gathered, what you can expect from it and what are its characteristics and limitations. We analyze how other mashups exploit DBpedia and present best practices of its usage. In particular, we describe how Sztakipedia—an intelligent writing aid based on DBpedia—can help Wikipedia contributors to improve the quality and integrity of articles. DBpedia offers a myriad of ways to accessing the information it contains, ranging from SPARQL to bulk download. We compare the pros and cons of these methods. We conclude that DBpedia is an unavoidable resource for applications dealing with commonly known entities like notable persons, places; and for others looking for a rich hub connecting other semantic resources.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 59.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 54.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    Since 2007, see http://stats.wikimedia.org/EN/#see_also.

  2. 2.

    http://dbpedia.org.

  3. 3.

    http://en.wikipedia.org/wiki/Wikipedia:Notability.

  4. 4.

    http://owl.man.ac.uk/factplusplus/.

  5. 5.

    http://www.w3.org/TR/owl-features/.

  6. 6.

    As of August 2012. For more details, see: http://wiki.dbpedia.org/Dataset.

  7. 7.

    http://www.freebase.com/.

  8. 8.

    Metaweb Technologies, Inc. It has been acquired by Google in 2010.

  9. 9.

    Especially the notability test: https://en.wikipedia.org/wiki/Wikipedia:Notability.

  10. 10.

    http://wiki.freebase.com/wiki/WEX.

  11. 11.

    http://www.wikidata.org.

  12. 12.

    http://wiki.dbpedia.org/DBpediaMobile.

  13. 13.

    http://www4.wiwiss.fu-berlin.de/flickrwrappr/.

  14. 14.

    http://graphics.stanford.edu/projects/vispedia/.

  15. 15.

    http://www.iais.fraunhofer.de/contentus.html.

  16. 16.

    For a screencast see: http://www.yovisto.com/labs/vissw2011/.

  17. 17.

    Deutsche Nationalbibliothek—DNB.

  18. 18.

    Personennamendatei—PND.

  19. 19.

    http://swse.deri.org/.

  20. 20.

    http://dbrec.net/.

  21. 21.

    http://revyu.com/.

  22. 22.

    http://stanbol.apache.org/.

  23. 23.

    http://www.opencalais.com/.

  24. 24.

    http://logd.tw.rpi.edu/demo/white-house-visit/search.

  25. 25.

    http://www.ae-info.org/.

  26. 26.

    http://faviki.com/.

  27. 27.

    If you want to do your own research, use Google Scholar and search for “DBpedia: a nucleus for a web of open data” and “DBpedia—a crystallization point for the Web of Data”. The two articles together received a remarkable 1,300 citations to date.

  28. 28.

    http://www.w3.org/TR/rdf-sparql-query/.

  29. 29.

    http://wiki.dbpedia.org/OnlineAccess.

  30. 30.

    http://virtuoso.openlinksw.com/.

  31. 31.

    More precisely it has a free version, besides the enterprise plan.

  32. 32.

    http://www.openlinksw.com/dataspace/dav/wiki/Main/VirtAWSPublicDataSets.

  33. 33.

    See literature on information extraction.

  34. 34.

    http://dbpedia.org/spotlight.

  35. 35.

    http://spotlight.dbpedia.org/gsoc/.

  36. 36.

    In the early stage of Sztakipedia project our team also developed a TinyMCE-based editor, but that is discontinued now.

  37. 37.

    tf-idf is a widely used statistical relevance measure. For details, see [23].

  38. 38.

    http://pedia.sztaki.hu/.

  39. 39.

    UIMA stands for Unstructured Information Management Architecture. It is a modular framework for annotating content. For more details, see http://uima.apache.org/.

  40. 40.

    http://lucene.apache.org.

  41. 41.

    These are the old and new machine interface standards supported by most library systems.

  42. 42.

    http://live.dbpedia.org/.

References

  1. Allemang D, Hendler JA (2008) Semantic web for the working ontologist: effective modeling in RDF and OWL. Morgan Kaufmann, San Mateo

    Google Scholar 

  2. Amin MS, Jamil H (2010) An efficient web-based wrapper and annotator for tabular data. Int J Softw Eng Knowl Eng 20(2):215

    Article  Google Scholar 

  3. Auer S, Bizer C, Kobilarov G, Lehmann J, Ives Z (2007) DBpedia: a nucleus for a web of open data. In: 6th international semantic web conference, Busan, Korea. Springer, Berlin, pp 11–15

    Google Scholar 

  4. Baader F (2003) The description logic handbook: theory, implementation, and applications. Cambridge University Press, Cambridge

    MATH  Google Scholar 

  5. Becker C, Bizer C (2008) DBpedia mobile: a location-enabled linked data browser. In: Linked data on the web (LDOW)

    Google Scholar 

  6. Bizer C, Lehmann J, Kobilarov G, Auer S, Becker C, Cyganiak R, Hellmann S (2009) DBpedia—a crystallization point for the web of data. Web Semant Sci Serv Agents World Wide Web 7(3):154–165. doi:10.1016/j.websem.2009.07.002

    Article  Google Scholar 

  7. Bollacker K, Evans C, Paritosh P, Sturge T, Taylor J (2008) Freebase: a collaboratively created graph database for structuring human knowledge. In: Proceedings of the 2008 ACM SIGMOD international conference on management of data, SIGMOD’08. ACM, New York, pp 1247–1250

    Chapter  Google Scholar 

  8. Chan B, Talbot J, Wu L, Sakunkoo N, Cammarano M, Hanrahan P (2009) Vispedia: on-demand data integration for interactive visualization and exploration. In: Proceedings of the 35th SIGMOD international conference on management of data. ACM, New York, pp 1139–1142

    Chapter  Google Scholar 

  9. Cyganiak R, Jentzsch A. Linking open data cloud diagram. http://lod-cloud.net/

  10. DiFranzo D, Graves A, Erickson JS, Ding L, Michaelis J, Lebo T, Patton E, Williams GT, Li X, Zheng JG et al. (2011) The web is my back-end: creating mashups with linked open government data. In: Linking government data, pp 205–219

    Chapter  Google Scholar 

  11. Ferrucci D, Lally A (2004) UIMA: an architectural approach to unstructured information processing in the corporate research environment. Nat Lang Eng 10(3–4):327–348

    Article  Google Scholar 

  12. Giles J (2005) Internet encyclopaedias go head to head. Nature 438(7070):900–901

    Article  Google Scholar 

  13. Heath T, Motta E (2008) Revyu: linking reviews and ratings into the web of data. Web Semant Sci Serv Agents World Wide Web 6(4):266–273

    Article  Google Scholar 

  14. Héder M (2011) Integrating artificial intelligence solutions into interfaces of online knowledge production. ICIC Express Lett 5(12):4395–4401

    Google Scholar 

  15. Héder M, Farkas M, Oláh T, Solt I (2011) Sztakipedia: mashing up natural language processing, recommender systems and search engines to support Wiki article editing. In: AI mashup challenge 2011 at extended semantic web conference (ESWC 2011), Iraklion, Crete, Greece

    Google Scholar 

  16. Hogan A, Harth A, Umbrich J, Kinsella S, Polleres A, Decker S (2011) Searching and browsing linked data with SWSE: the semantic web search engine. Web Semant Sci Serv Agents World Wide Web 9(4):365–401. doi:10.1016/j.websem.2011.06.004

    Article  Google Scholar 

  17. Jain P, Hitzler P, Yeh PZ, Verma K, Sheth AP (2010) Linked data is merely more data. In: Linked data meets artificial intelligence, pp 82–86

    Google Scholar 

  18. Kobilarov G, Scott T, Raimond Y, Oliver S, Sizemore C, Smethurst M, Bizer C, Lee R (2009) Media meets semantic web—how the BBC uses DBpedia and linked data to make connections. In: The semantic web: research and applications, pp 723–737

    Chapter  Google Scholar 

  19. Korica-Pehserl P, Latif A (2011) Meshing semantic web and Web 2.0 technologies to construct profiles: case study of Academia Europea members. In: Networked digital technologies, pp 334–344

    Chapter  Google Scholar 

  20. Mcguinness DL, Fikes R, Hendler J, Stein LA (2002) Daml+oil: an ontology language for the semantic web. IEEE Intell Syst 17(5):72–80

    Article  Google Scholar 

  21. Mendes PN, Jakob M, García-Silva A, Bizer C (2011) DBpedia spotlight: shedding light on the web of documents. In: Proceedings of the 7th international conference on semantic systems (I-semantics), pp 1–8. doi:10.1145/2063518.2063519

    Google Scholar 

  22. Passant A (2010) dbrec: music recommendations using DBpedia. In: Proceedings of the 9th international semantic web conference on the semantic web, ISWC’10, vol II. Springer, Berlin, pp 209–224

    Google Scholar 

  23. Salton G, Buckley C (1988) Term-weighting approaches in automatic text retrieval* 1. Inf Process Manag 24(5):513–523

    Article  Google Scholar 

  24. Waitelonis J, Osterhoff J, Sack H (2011) More than the sum of its parts: Contentus—a semantic multimodal search user interface. In: Proceedings of workshop on visual interfaces to the social and semantic web (VISSW), co-located with ACM IUI, vol 13

    Google Scholar 

  25. Wood A, Struthers K (2010) Pathology education, Wikipedia and the Net generation. Med Teacher 32(7):618

    Article  Google Scholar 

Download references

Acknowledgements

We are deeply indebted to Domonkos Tikk and Pablo Mendes for their valuable comments on the text. The authors were partially supported by the grant TÁMOP-4.2.2.B-10/1-2010-0009 of the Hungarian National Development Agency (NFÜ).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Mihály Héder .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer-Verlag Berlin Heidelberg

About this chapter

Cite this chapter

Héder, M., Solt, I. (2013). DBpedia Mashups. In: Endres-Niggemeyer, B. (eds) Semantic Mashups. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-36403-7_4

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-36403-7_4

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-36402-0

  • Online ISBN: 978-3-642-36403-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics