LOD Lab: Experiments at LOD Scale

Rietveld, Laurens; Beek, Wouter; Schlobach, Stefan

doi:10.1007/978-3-319-25010-6_23

Laurens Rietveld²⁴,
Wouter Beek²⁴ &
Stefan Schlobach²⁴

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 9367))

Included in the following conference series:

International Semantic Web Conference

2508 Accesses
11 Citations

Abstract

Contemporary Semantic Web research is in the business of optimizing algorithms for only a handful of datasets such as DBpedia, BSBM, DBLP and only a few more. This means that current practice does not generally take the true variety of Linked Data into account. With hundreds of thousands of datasets out in the world today the results of Semantic Web evaluations are less generalizable than they should and — this paper argues — can be. This paper describes LOD Lab: a fundamentally different evaluation paradigm that makes algorithmic evaluation against hundreds of thousands of datasets the new norm. LOD Lab is implemented in terms of the existing LOD Laundromat architecture combined with the new open-source programming interface Frank that supports Web-scale evaluations to be run from the command-line. We illustrate the viability of the LOD Lab approach by rerunning experiments from three recent Semantic Web research publications and expect it will contribute to improving the quality and reproducibility of experimental work in the Semantic Web community. We show that simply rerunning existing experiments within this new evaluation paradigm brings up interesting research questions as to how algorithmic performance relates to (structural) properties of the data.

Download to read the full chapter text

Chapter PDF

LOD Lab: Scalable Linked Data Processing

A Collection of Benchmark Datasets for Systematic Evaluations of Machine Learning on the Semantic Web

On the Status of Experimental Research on the Semantic Web

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

Bazoobandi, H.R., de Rooij, S., Urbani, J., ten Teije, A., van Harmelen, F., Bal, H.: A compact in-memory dictionary for RDF data. In: Gandon, F., Sabou, M., Sack, H., d’Amato, C., Cudré-Mauroux, P., Zimmermann, A. (eds.) ESWC 2015. LNCS, vol. 9088, pp. 205–220. Springer, Heidelberg (2015)
Chapter Google Scholar
Beek, W., Rietveld, L.: Frank: The lod cloud at your fingertips. In: Developers Workshop, ESWC (2015)
Google Scholar
Beek, W., Rietveld, L., Bazoobandi, H.R., Wielemaker, J., Schlobach, S.: LOD laundromat: a uniform way of publishing other people’s dirty data. In: Mika, P., Tudorache, T., Bernstein, A., Welty, C., Knoblock, C., Vrandečić, D., Groth, P., Noy, N., Janowicz, K., Goble, C. (eds.) ISWC 2014, Part I. LNCS, vol. 8796, pp. 213–228. Springer, Heidelberg (2014)
Google Scholar
Bizer, C., Schultz, A.: The berlin sparql benchmark (2009)
Google Scholar
Boncz, P., Fundulaki, I., Gubichev, A., Larriba-Pey, J., Neumann, T.: The linked data benchmark council project. Datenbank-Spektrum 13(2), 121–129 (2013)
Article Google Scholar
Buil-Aranda, C., Hogan, A., Umbrich, J., Vandenbussche, P.-Y.: SPARQL web-querying infrastructure: ready for action? In: Alani, H., Kagal, L., Fokoue, A., Groth, P., Biemann, C., Parreira, J.X., Aroyo, L., Noy, N., Welty, C., Janowicz, K. (eds.) ISWC 2013, Part II. LNCS, vol. 8219, pp. 277–293. Springer, Heidelberg (2013)
Chapter Google Scholar
Euzenat, J., Meilicke, C., Stuckenschmidt, H., Shvaiko, P., Trojahn, C.: Ontology alignment evaluation initiative: six years of experience. In: Spaccapietra, S. (ed.) Journal on Data Semantics XV. LNCS, vol. 6720, pp. 158–192. Springer, Heidelberg (2011)
Chapter Google Scholar
Fernández, J.D., Martínez-Prieto, M.A., Gutiérrez, C., Polleres, A., Arias, M.: Binary rdf representation for publication and exchange (hdt). Web Semantics: Science, Services and Agents on the World Wide Web 19, 22–41 (2013)
Article Google Scholar
Guo, Y., Pan, Z., Heflin, J.: Lubm: A benchmark for owl knowledge base systems. Web Semantics: Science, Services and Agents on the World Wide Web 3(2), 158–182 (2005)
Article Google Scholar
Harris, S., Seaborne, A.: SPARQL 1.1 query language, March 2013
Google Scholar
Harth, A.: Billion Triples Challenge data set. Downloaded from (2012). http://km.aifb.kit.edu/projects/btc-2012/
Hogan, A., Umbrich, J., Harth, A., Cyganiak, R., Polleres, A., Decker, S.: An Empirical Survey of Linked Data Conformance. Web Semantics: Science, Services and Agents on the World Wide Web 14, 14–44 (2012)
Article Google Scholar
Isele, R., Umbrich, J., Bizer, C., Harth, A.: Ldspider: an open-source crawling framework for the web of linked data. In: 9th International Semantic Web Conference (ISWC 2010). Citeseer (2010)
Google Scholar
Rietveld, L., Beek, W., Schlobach, S.: LOD in a box: The C-LOD meta-dataset (Under submission). http://www.semantic-web-journal.net/system/files/swj868.pdf
Rietveld, L., Verborgh, R., Beek, W., Vander Sande, M., Schlobach, S.: Linked data-as-a-service: the semantic web redeployed. In: Gandon, F., Sabou, M., Sack, H., d’Amato, C., Cudré-Mauroux, P., Zimmermann, A. (eds.) ESWC 2015. LNCS, vol. 9088, pp. 471–487. Springer, Heidelberg (2015)
Chapter Google Scholar
Schmachtenberg, M., Bizer, C., Paulheim, H.: Adoption of the linked data best practices in different topical domains. In: Mika, P., Tudorache, T., Bernstein, A., Welty, C., Knoblock, C., Vrandečić, D., Groth, P., Noy, N., Janowicz, K., Goble, C. (eds.) ISWC 2014, Part I. LNCS, vol. 8796, pp. 245–260. Springer, Heidelberg (2014)
Google Scholar
Schmidt, M., Hornung, T., Lausen, G., Pinkel, C.: Sp 2 bench: A sparql performance benchmark, icde, Shanghai, China (2009)
Google Scholar
Schmidt, M., Görlitz, O., Haase, P., Ladwig, G., Schwarte, A., Tran, T.: FedBench: a benchmark suite for federated semantic data query processing. In: Aroyo, L., Welty, C., Alani, H., Taylor, J., Bernstein, A., Kagal, L., Noy, N., Blomqvist, E. (eds.) ISWC 2011, Part I. LNCS, vol. 7031, pp. 585–600. Springer, Heidelberg (2011)
Chapter Google Scholar
Verborgh, R., et al.: Querying datasets on the web with high availability. In: Mika, P., et al. (eds.) ISWC 2014, Part I. LNCS, vol. 8796, pp. 180–196. Springer, Heidelberg (2014)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science, VU University Amsterdam, Amsterdam, The Netherlands
Laurens Rietveld, Wouter Beek & Stefan Schlobach

Authors

Laurens Rietveld
View author publications
You can also search for this author in PubMed Google Scholar
Wouter Beek
View author publications
You can also search for this author in PubMed Google Scholar
Stefan Schlobach
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Laurens Rietveld .

Editor information

Editors and Affiliations

Pontificia Universidad Católica de Chile, Santiago de Chile, Chile
Marcelo Arenas
Universidad Politecnica de Madrid, Boadilla del Monte, Spain
Oscar Corcho
University of Southampton, Southampton, United Kingdom
Elena Simperl
Sozialwissenschaften eV, GESIS Leibniz-Institut, Köln, Nordrhein-Westfalen, Germany
Markus Strohmaier
The Open University, Milton Keynes, United Kingdom
Mathieu d'Aquin
IBM Research, Yorktown Heights, New York, USA
Kavitha Srinivas
Elsevier Labs., Amsterdam, The Netherlands
Paul Groth
School of Medicine, Stanford University, Stanford, California, USA
Michel Dumontier
Lehigh University, Bethlehem, Pennsylvania, USA
Jeff Heflin
DAYTON, Ohio, USA
Krishnaprasad Thirunarayan
University of Koblenz-Landau, Koblenz, Rheinland-Pfalz, Germany
Steffen Staab

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Rietveld, L., Beek, W., Schlobach, S. (2015). LOD Lab: Experiments at LOD Scale. In: Arenas, M., et al. The Semantic Web - ISWC 2015. ISWC 2015. Lecture Notes in Computer Science(), vol 9367. Springer, Cham. https://doi.org/10.1007/978-3-319-25010-6_23

Download citation

DOI: https://doi.org/10.1007/978-3-319-25010-6_23
Published: 24 October 2015
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-25009-0
Online ISBN: 978-3-319-25010-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

LOD Lab: Experiments at LOD Scale

Abstract

Chapter PDF

Similar content being viewed by others

LOD Lab: Scalable Linked Data Processing

A Collection of Benchmark Datasets for Systematic Evaluations of Machine Learning on the Semantic Web

On the Status of Experimental Research on the Semantic Web

Keywords

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

LOD Lab: Experiments at LOD Scale

Abstract

Chapter PDF

Similar content being viewed by others

LOD Lab: Scalable Linked Data Processing

A Collection of Benchmark Datasets for Systematic Evaluations of Machine Learning on the Semantic Web

On the Status of Experimental Research on the Semantic Web

Keywords

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation