Skip to main content

Benchmarking RDF Query Engines and Instance Matching Systems

  • Chapter
  • First Online:
Linked Data

Abstract

Standards and benchmarking have traditionally been used as the main tools to formally define and provably illustrate the level of the adequacy of systems to address the new challenges. In this chapter, we discuss benchmarks for RDF query engines and instance matching systems. In practice, benchmarks are used to inform users of the strengths and weaknesses of competing tools and approaches, but more importantly, they encourage the advancement of technology by providing both academia and industry with clear targets for performance and functionality.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 139.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 179.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 179.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. D.J. Abadi, A. Marcus, S.R. Madden, K. Hollenbach, Scalable semantic web data management using vertical partitioning, in Proceedings of the 33rd International Conference on Very Large Data Bases, VLDB Endowment (2007), pp. 411–422

    Google Scholar 

  2. J.L. Aguirre, K. Eckert, J. Euzenat, A. Ferrara, W.R. van Hage, L. Hollink, C. Meilicke, A. Nikolov, D. Ritze, F. Scharffe, P. Shvaiko, O. Svab-Zamazal, C. Trojahn, E. Jimenez-Ruiz, B. Cuenca Grau, B. Zapilko, Results of the ontology alignment evaluation initiative 2012, in OM (2012)

    Google Scholar 

  3. G. Aluc, O. Hartig, T. Ozsu, K. Daudjee, Diversified stress testing of RDF data management systems, in ISWC (2014)

    Google Scholar 

  4. S. Araujo, A. de Vries, D. Schwabe, SERIMI results for OAEI 2011, in OM (2011)

    Google Scholar 

  5. C. Bizer, A. Schultz, The Berlin SPARQL Benchmark. Int. J. Semant. Web Inf. Syst. 5(2) (2009)

    Google Scholar 

  6. C. Böhm, G. de Melo, F. Naumann, G. Weikum, LINDA: distributed web-of-data-scale entity matching, in CIKM (2012)

    Book  Google Scholar 

  7. T. Bohme, E. Rahm, XMach-1: a benchmark for XML data management, in BTW (2001)

    Google Scholar 

  8. P. Boncz, T. Neumann, O. Erling, TPC-H analyzed: hidden messages and lessons learned from an influential benchmark, in TPCTC (2013). Revised Selected Papers

    Google Scholar 

  9. S. Bressan, M.L. Lee, Y.G. Li, Z. Lacroix, U. Nambiar, XML management system benchmarks, in XML Data Management: Native XML and XML-Enabled Database Systems (Addison Wesley, Boston, 2003)

    Google Scholar 

  10. D. Brickley, R.V. Guha, RDF Schema 1.1. https://www.w3.org/TR/rdf-schema/, February 2014. W3C Recommendation

  11. S. Castano, A. Ferrara, S. Montanelli, G. Racca, Semantic information interoperability in open networked systems, in ICSNW (2004)

    Google Scholar 

  12. M. Cheatham, Z. Dragisic, J. Euzenat, D. Faria, A. Ferrara, G. Flouris, I. Fundulaki, R. Granada, V. Ivanova, E. Jimenez-Ruiz, P. Lambrix, S. Montanelli, C. Pesquita, T. Saveta, P. Shvaiko, A. Solimando, C. Trojahn, O. Zamazal, Results of the ontology alignment evaluation initiative 2015, in OM (2015)

    Google Scholar 

  13. I.F. Cruz, C. Stroe, F. Caimi, A. Fabiani, C. Pesquita, F.M. Couto, M. Palmonari, Using AgreementMaker to align ontologies for OAEI 2011, in OM (2011)

    Google Scholar 

  14. E. Daskalaki, D. Plexousakis, OtO matching system: a multi-strategy approach to instance matching, in CAiSE (2012)

    Google Scholar 

  15. J. David, J. Euzenat, F. Scharffe, C. Trojahn, The alignment api 4.0. Semant. Web J. 2(1), 3–10 (2011)

    Google Scholar 

  16. DBpedia: Towards a Public Data Infrastructure for a Large, Multilingual, Semantic Knowledge Graph. http://wiki.dbpedia.org/

  17. K.M. Dixit, Overview of the SPEC Benchmarks, in The Benchmark Handbook for Database and Transaction Systems, 2nd edn. (Morgan Kaufmann, San Francisco, 1993)

    Google Scholar 

  18. S. Duan, A. Kementsietsidis, K. Srinivas, O. Udrea, Apples and oranges: a comparison of RDF benchmarks and real RDF datasets, in SIGMOD (2011)

    Book  Google Scholar 

  19. Dublin Core Metadata Initiative. http://dublincore.org/

  20. A.K. Elmagarmid, P. Ipeirotis, V. Verykios, Duplicate record detection: a survey, in IEEE TKDE (2007)

    Google Scholar 

  21. J. Euzenat, A. Ferrara, L. Hollink, A. Isaac, C. Joslyn, V. Malaise, C. Meilicken, A. Nikolov, J. Pane, M. Sabou, F. Scharffe, P. Shvaiko, V.S.H. Stuckenschmidt, O. Svab-Zamazal, V. Svatek, C. Trojahn, G. Vouros, S. Wang, Results of the ontology alignment evaluation initiative 2009, in OM (2009)

    Google Scholar 

  22. J. Euzenat, A. Ferrara, C. Meilicke, J. Pane, F. Schare, P. Shvaiko, H. Stuckenschmidt, O. Svab-Zamazal, V. Svatek, C. Trojahn, Results of the ontology alignment evaluation initiative 2010, in OM (2010)

    Google Scholar 

  23. J. Euzenat, A. Ferrara, W.R. van Hage, L. Hollink, C. Meilicke, A. Nikolov, F. Scharffe, P. Shvaiko, H. Stuckenschmidt, O. Svab-Zamazal, C. Trojahn, Final results of the ontology alignment evaluation initiative 2011, in OM (2011)

    Google Scholar 

  24. Febrl project. http://sourceforge.net/projects/febrl/

  25. A. Ferrara, D. Lorusso, S. Montanelli, G. Varese, Towards a benchmark for instance matching, in OM (2008)

    Google Scholar 

  26. A. Ferrara, S. Montanelli, J. Noessner, H. Stuckenschmidt, Benchmarking matching applications on the semantic web, in ESWC (2011)

    Google Scholar 

  27. Fodor and Zagat’s Restaurant Guide. http://userweb.cs.utexas.edu/users/ml/riddle/data.html

  28. Freebase. http://www.freebase.com/base/fbontology

  29. GeoNames. http://www.geonames.org/

  30. C. Goutte, E. Gaussier, A probabilistic interpretation of precision, recall, and F-score, with implication for evaluation, in ECIR (2005)

    Google Scholar 

  31. B.C. Grau, Z. Dragisic, K. Eckert, J. Euzenat, A. Ferrara, R. Granada, V. Ivanova, E. Jimenez-Ruiz, A.O. Kempf, P. Lambrix, A. Nikolov, H. Paulheim, D. Ritze, F. Schare, P. Shvaiko, C. Trojahn, O. Zamazal, Results of the ontology alignment evaluation initiative 2013, in OM (2013)

    Google Scholar 

  32. J. Gray (ed.), The Benchmark Handbook for Database and Transaction Systems, 2nd edn. (Morgan Kaufmann, San Francisco, 1993)

    MATH  Google Scholar 

  33. Y. Guo, Z. Pan, J. Heflin, LUBM: a benchmark for OWL knowledge base systems. J. Web Semant. 3(2–3), 158–182 (2005)

    Article  Google Scholar 

  34. O. Hassanzadeh, R. Xin, R.J. Miller, A. Kementsietsidis, L. Lim, M. Wang, Linkage query writer. Proc. VLDB Endow. 2(2) (2009)

    Google Scholar 

  35. T. Heath, C. Bizer, Linked Data: Evolving the Web into a Global Data Space, in Synthesis Lectures on the Semantic Web: Theory and Technology, 1st edn. (Morgan and Claypool, San Rafael, 2011)

    Google Scholar 

  36. M.A. Hernandez, S.J. Stolfo, The merge/purge problem for large databases. SIGMOD Rec. 24(2) (1995)

    Google Scholar 

  37. W. Hu, J. Chen, C. Cheng, Y. Qu, Objectcoref & falcon-ao: results for oaei 2010, in OM (2010)

    Google Scholar 

  38. K. Huppler, The art of building a good benchmark, in TPCTC (2009)

    Google Scholar 

  39. E. Ioannou, N. Rassadko, Y. Velegrakis, On generating benchmark data for entity matching. J. Data Semant. 2(1), 37–56 (2013)

    Article  Google Scholar 

  40. R. Isele, C. Bizer, Learning expressive linkage rules using genetic programming. Proc. VLDB Endow. 5(11) (2012)

    Google Scholar 

  41. R. Isele, C. Bizer, Active learning of expressive linkage rules using genetic programming. J. Web Semant. 23 (2013)

    Google Scholar 

  42. Y.R. Jean-Mary, E.P. Shironoshita, M.R. Kabuka, ASMOV: results for OAEI 2009, in OM (2009)

    Google Scholar 

  43. Y.R. Jean-Mary, E.P. Shironoshita, M.R. Kabuka, ASMOV: results for OAEI 2010 Proceedings 5th ISWC Workshop on Ontology Matching, in OM (2010)

    Google Scholar 

  44. E. Jimenez-Ruiz, B. Cuenca Grau, I. Horrocks, LogMap and LogMapLt results for OAEI 2012, in OM (2012)

    Google Scholar 

  45. E. Jimenez-Ruiz, B. Cuenca Grau, I. Horrocks, LogMap and LogMapLt results for OAEI 2013, in OM (2013)

    Google Scholar 

  46. E. Jimenez-Ruiz, B. Cuenca Grau, W. Xia, A. Solimando, X. Chen, V. Cross, Y. Gong, S. Zhang, A. Chennai-Thiagarajan, LogMap family results for OAEI 2014, in OM (2014)

    Google Scholar 

  47. E. Jimenez-Ruiz, C. Grau, A. Solimando, V. Cross, LogMap family results for OAEI 2015, in OM (2015)

    Google Scholar 

  48. A. Khiat, M. Benaissa, InsMT/InsMTL results for OAEI 2014 instance matching, in OM (2014)

    Google Scholar 

  49. A. Khiat, M. Benaissa, M.-A. Belfedhal, STRIM results for OAEI 2015 instance matching evaluation, in OM (2015)

    Google Scholar 

  50. V. Kotsev, N. Minadakis, V. Papakonstantinou, O. Erling, I. Fundulaki, A. Kiryakov, Benchmarking RDF query engines: the LDBC semantic publishing benchmark, in BLINK (2016)

    Google Scholar 

  51. L. Leito, P. Calado, M. Herschel, An overview of XML duplicate detection algorithms, in Soft Computing in XML Data Management, vol. 255 (Springer, Berlin, 2010)

    Google Scholar 

  52. C. Levine, TPC-C: The OLTP Benchmark, in SIGMOD, 1997. Industrial Session

    Google Scholar 

  53. C. Li, L. Jin, S. Mehrotra, Supporting efficient record linkage for large data sets using mapping techniques, in WWW (2006)

    Google Scholar 

  54. S. Manegold, I. Manolescu, Performance evaluation in database research: principles and experience, in EDBT, 2009. Tutorial

    Google Scholar 

  55. D.L. McGuinness, F. van Harmelen, OWL Web Ontology Language Overview. https://www.w3.org/TR/owl-features/, February 2004. W3C Recommendation

  56. M. Morsey, J. Lehmann, S. Auer, A.-C. Ngonga Ngomo, DBpedia SPARQL benchmark – performance assessment with real queries on real data, in ISWC (2011)

    Google Scholar 

  57. M. Nagy, M. Vargas-Vera, P. Stolarski, DSSim results for OAEI 2009, in OM (2009)

    Google Scholar 

  58. R.O. Nambiar, M. Poess, A. Masland, H.R. Taheri, M. Emmerton, F. Carman, M. Majdalany, TPC benchmark roadmap, in Selected Topics in Performance Evaluation and Benchmarking (Springer, Berlin, 2012)

    Google Scholar 

  59. T. Neumann, G. Weikum, RDF-3X: a RISC-style engine for RDF. PVLDB 1(1) (2008)

    Google Scholar 

  60. A.-C. Ngonga Ngomo, S. Auer, LIMES: a time-efficient approach for large-scale link discovery on the web of data, in IJCAI (2011)

    Google Scholar 

  61. A.-C. Ngonga Ngomo, D. Schumacher, Borderflow: a local graph clustering algorithm for natural language processing, in CICLing (2009)

    Google Scholar 

  62. K. Nguyen, R. Ichise, SLINT+ results for OAEI 2013 instance matching, in OM (2013)

    Google Scholar 

  63. X. Niu, S. Rong, Y. Zhang, H. Wang, Zhishi.links results for OAEI 2011, in OM (2011)

    Google Scholar 

  64. J. Noessner, M. Niepert, CODI: combinatorial optimization for data integration – results for OAEI 2010, in OM (2010)

    Google Scholar 

  65. OKKAM Project. http://project.okkam.org/

  66. R. Othayoth Nambiar, M. Poess, A. Masland, H.R. Taheri, M. Emmerton, F. Carman, M. Majdalany, TPC Benchmark Roadmap 2012, in TPCTC (2012)

    Google Scholar 

  67. H.K. Patni, C.A. Henson, A.P. Sheth, Linked sensor data, in CTS (2010)

    Google Scholar 

  68. N. Redaschi, UniProt Consortium, UniProt in RDF: tackling data integration and distributed annotation with the semantic web, in Biocuration Conference (2009)

    Google Scholar 

  69. F. Saïs, N. Niraula, N. Pernelle, M.C. Rousset, LN2R – a knowledge based reference reconciliation system: OAEI 2010 Results, in OM (2010)

    Google Scholar 

  70. M. Saleem, Q. Mehmood, A.-C. Ngonga Ngomo, FEASIBLE: a feature-based SPARQL benchmark generation framework, in ISWC (2011)

    Google Scholar 

  71. T. Saveta, E. Daskalaki, G. Flouris, I. Fundulaki, M. Herschel, A.-C. Ngonga Ngomo, LANCE: piercing to the heart of instance matching tool, in ISWC (2015)

    Google Scholar 

  72. T. Saveta, E. Daskalaki, G. Flouris, I. Fundulaki, M. Herschel, A.-C. Ngonga Ngomo, Pushing the limits of instance matching systems: a semantics-aware benchmark for linked data, in WWW, Companion Volume (2015)

    Book  Google Scholar 

  73. A.R. Schmidt, F. Wass, M. Kersten, D. Florescu, M.J. Carey, I. Manolescu, R. Busse, XMark: a benchmark for XML data management, in VLDB (2002)

    Book  Google Scholar 

  74. M. Schmidt, T. Hornung, M. Meier, C. Pinkel, G. Lausen, SP2Bench: a SPARQL performance benchmark, in Semantic Web Information Management (Springer, Berlin, 2009)

    Google Scholar 

  75. Md.H. Seddiqui, M. Aono, Anchor-flood: results for OAEI 2009, in OM (2009)

    Google Scholar 

  76. C. Shao, L. Hu, J. Li, RiMOM-IM results for OAEI 2014, in OM (2014)

    Google Scholar 

  77. P. Singla, P. Domingos, Multi-relational record linkage, in MRDM (2004). Co-located with KDD

    Google Scholar 

  78. H. Stoermer, N. Rassadko, Results of OKKAM feature based entity matching algorithm for instance matching contest of OAEI 2009, in OM (2009)

    Google Scholar 

  79. M. Suchanek, G. Kasneci, G. Weikum, YAGO: a core of semantic knowledge unifying WordNet and Wikipedia, in WWW (2007)

    Book  Google Scholar 

  80. Y. Sure, S. Bloehdorn, P. Haase, J. Hartmann, D. Oberle, The SWRC ontology – semantic web for research communities, in EPIA (2005)

    Google Scholar 

  81. A. Taheri, M. Shamsfard, SBUEI: results for OAEI 2012, in OM (2012)

    Google Scholar 

  82. Transaction Processing Council. http://www.tpc.org/

  83. J. Volz, C. Bizer, M. Gaedke, G. Kobilarov, Discovering and maintaining links on the Web of Data, in ISWC (2009)

    Google Scholar 

  84. J. Wang, X. Zhang, L. Hou, Y. Zhao, J. Li, Y. Qi, J. Tang, RiMOM results for OAEI 2010, in OM (2010)

    Google Scholar 

  85. R.P. Weicker, An overview of common benchmarks. Computer 23(12) (1990)

    Google Scholar 

  86. B.B. Yao, M. Tamer Özsu, N. Khandelwal, XBench benchmark and performance testing of XML DBMSs, in ICDE (2004)

    Google Scholar 

  87. K. Zaiss, Instance-based ontology matching and the evaluation of matching systems. PhD thesis, Heinrich-Heine-Universiat Dusseldorf, 2010

    Google Scholar 

  88. K. Zaiss, S. Conrad, S.A. Vater, Benchmark for testing instance-based ontology matching methods, in KMIS (2010)

    Google Scholar 

  89. X. Zhang, Q. Zhong, F. Shi, J. Li, J. Tang, RiMOM results for OAEI 2009, in OM (2009)

    Google Scholar 

  90. Q. Zheng, C. Shao, J. Li, Z. Wang, L. Hu, RiMOM2013 results for OAEI 2013, in OM (2013)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer International Publishing AG

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Sakr, S., Wylot, M., Mutharaju, R., Le Phuoc, D., Fundulaki, I. (2018). Benchmarking RDF Query Engines and Instance Matching Systems. In: Linked Data. Springer, Cham. https://doi.org/10.1007/978-3-319-73515-3_7

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-73515-3_7

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-73514-6

  • Online ISBN: 978-3-319-73515-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics