A Framework for Quality Assessment of Semantic Annotations of Tabular Data

Avogadro, Roberto; Cremaschi, Marco; Jiménez-Ruiz, Ernesto; Rula, Anisa

doi:10.1007/978-3-030-88361-4_31

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 12922))

Included in the following conference series:

International Semantic Web Conference

3269 Accesses
1 Altmetric

Abstract

Much information is conveyed within tables, which can be semantically annotated by humans or (semi)automatic approaches. Nevertheless, many applications cannot take full advantage of semantic annotations because of the low quality. A few methodologies exist for the quality assessment of semantic annotation of tabular data, but they do not automatically assess the quality as a multidimensional concept through different quality dimensions. The quality dimensions are implemented in STILTool 2, a web application to automate the quality assessment of the annotations. The evaluation is carried out by comparing the quality of semantic annotations with gold standards. The work presented here has been applied to at least three use cases. The results show that our approach can give us hints about the quality issues and how to address them.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
https://wiki.dbpedia.org/.
2.
http://webdatacommons.org/webtables/goldstandardV2.html table index: 14311244_0_7604843865524657408, 49801939_0_6964113429298874283.
3.
http://dbpedia.org/resource/Mountain.
4.
https://stiltool.disco.unimib.it/.
5.
https://www.djangoproject.com/.
6.
https://www.mongodb.com/.
7.
https://bitbucket.org/disco_unimib/stiltool/.
8.
https://hub.docker.com/repository/docker/cremarco/stiltool.
9.
http://www.haproxy.org/.
10.
https://docs.celeryproject.org/en/latest/userguide/workers.html.
11.
https://www.cs.ox.ac.uk/isg/challenges/sem-tab/2020/index.html.
12.
http://webdatacommons.org/webtables/goldstandardV2.html.
13.
http://www.cs.ox.ac.uk/isg/challenges/sem-tab/.
14.
https://en.wikipedia.org/wiki/Mont_Blanc_(Moon).

References

Cremaschi, M., Siano, A., Avogadro, R., Jimenez-Ruiz, E., Maurino, A.: STILTool: a semantic table interpretation evaLuation tool. In: Harth, A., et al. (eds.) ESWC 2020. LNCS, vol. 12124, pp. 61–66. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-62327-2_11
Chapter Google Scholar
Cutrona, V., Bianchi, F., Jiménez-Ruiz, E., Palmonari, M.: Tough tables: carefully evaluating entity linking for tabular data. In: Pan, J.Z., et al. (eds.) ISWC 2020. LNCS, vol. 12507, pp. 328–343. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-62466-8_21
Chapter Google Scholar
Debattista, J., Auer, S., Lange, C.: Luzzu-a methodology and framework for linked data quality assessment. JDIQ 8(1) (2016)
Google Scholar
Debattista, J., Lange, C., Auer, S., Cortis, D.: Evaluating the quality of the LOD cloud: an empirical investigation. SWJ 9(6), 859–901 (2018)
Google Scholar
Dimou, A., et al.: DBpedia mappings quality assessment. In: Poster & Demo at ISWC, vol. 1690. CEUR (2016)
Google Scholar
Dimou, A., et al.: Assessing and refining mappings to RDF to improve dataset quality. In: Arenas, M., et al. (eds.) ISWC 2015. LNCS, vol. 9367, pp. 133–149. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-25010-6_8
Chapter Google Scholar
Fetahu, B., Anand, A., Koutraki, M.: TableNet: an approach for determining fine-grained relations for Wikipedia tables. In: WWW 2019, pp. 2736–2742. ACM (2019)
Google Scholar
Hogan, A., Umbrich, J., Harth, A., Cyganiak, R., Polleres, A., Decker, S.: An empirical survey of linked data conformance. JWS 14, 14–44 (2012)
Article Google Scholar
Jiménez-Ruiz, E., Hassanzadeh, O., Efthymiou, V., Chen, J., Srinivas, K.: SemTab 2019: resources to benchmark tabular data to knowledge graph matching systems. In: Harth, A., et al. (eds.) ESWC 2020. LNCS, vol. 12123, pp. 514–530. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-49461-2_30
Chapter Google Scholar
Jimenéz-Ruiz, E., Hassanzadeh, O., Efthymiou, V., Chen, J., Srinivas, K., Cutrona, V.: Results of SemTab 2020. In: Proceedings of the Semantic Web Challenge on Tabular Data to Knowledge Graph Matching, vol. 2775, pp. 1–8 (2020)
Google Scholar
Junior, A.C., Debattista, J., O’Sullivan, D.: Assessing the quality of R2RML mappings. In: Joint Proceedings of the 1st Sem4Tra and the 1st AMAR at SEMANTiCS 2019), vol. 2447. CEUR (2019)
Google Scholar
Lehmberg, O., Ritze, D., Meusel, R., Bizer, C.: A large public corpus of web tables containing time and context metadata. In: WWW 2016, pp. 75–76. ACM (2016)
Google Scholar
Limaye, G., Sarawagi, S., Chakrabarti, S.: Annotating and searching web tables using entities, types and relationships. VLDB 3(1–2), 1338–1347 (2010)
Google Scholar
Moreau, B., Serrano-Alvarado, P.: Assessing the quality of RDF mappings with EvaMap. In: Harth, A., et al. (eds.) ESWC 2020. LNCS, vol. 12124, pp. 164–167. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-62327-2_28
Chapter Google Scholar
Paulheim, H.: Knowledge graph refinement: a survey of approaches and evaluation methods. Semant. Web 8(3), 489–508 (2017)
Article Google Scholar
Randles, A., Junior, A.C., O’Sullivan, D.: Towards a vocabulary for mapping quality assessment. In: The 15th OM at (ISWC 2020), vol. 2788, pp. 241–242 (2020)
Google Scholar
Rashid, M., Torchiano, M., Rizzo, G., Mihindukulasooriya, N., Corcho, O.: A quality assessment approach for evolving knowledge bases. SWJ 10(2), 349–383 (2019)
Google Scholar
Sejdiu, G., Rula, A., Lehmann, J., Jabeen, H.: A scalable framework for quality assessment of RDF datasets. In: Ghidini, C., et al. (eds.) ISWC 2019. LNCS, vol. 11779, pp. 261–276. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-30796-7_17
Chapter Google Scholar
Zaveri, A., Rula, A., Maurino, A., Pietrobon, R., Lehmann, J., Auer, S.: Quality assessment for linked data: a survey. SWJ 7(1), 63–93 (2016)
Google Scholar
Zhang, S., Meij, E., Balog, K., Reinanda, R.: Novel entity discovery from web tables. In: WWW 2020, pp. 1298–1308. ACM (2020)
Google Scholar
Zhang, Z.: Effective and efficient semantic table interpretation using tableminer+. SWJ 8(6), 921–957 (2017)
Google Scholar

Download references

Author information

Authors and Affiliations

University of Milano - Bicocca, Milano, Italy
Roberto Avogadro & Marco Cremaschi
City, University of London, London, UK
Ernesto Jiménez-Ruiz
University of Oslo, Oslo, Norway
Ernesto Jiménez-Ruiz
University of Brescia, Brescia, Italy
Anisa Rula

Authors

Roberto Avogadro
View author publications
You can also search for this author in PubMed Google Scholar
Marco Cremaschi
View author publications
You can also search for this author in PubMed Google Scholar
Ernesto Jiménez-Ruiz
View author publications
You can also search for this author in PubMed Google Scholar
Anisa Rula
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Marco Cremaschi .

Editor information

Editors and Affiliations

University of Würzburg, Würzburg, Germany
Andreas Hotho
Linköping University, Linköping, Sweden
Eva Blomqvist
University of Düsseldorf, Düsseldorf, Germany
Stefan Dietze
IBM Research - Thomas J. Watson Research, Hawthorne, CA, USA
Achille Fokoue
University of Texas, Austin, TX, USA
Ying Ding
Imperial College, London, UK
Payam Barnaghi
Australian National University, Canberra, ACT, Australia
Armin Haller
Fondazione Bruno Kessler, Povo, Trento, Italy
Mauro Dragoni
The Open University Walton Hall, Milton Keynes, UK
Harith Alani

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Avogadro, R., Cremaschi, M., Jiménez-Ruiz, E., Rula, A. (2021). A Framework for Quality Assessment of Semantic Annotations of Tabular Data. In: Hotho, A., et al. The Semantic Web – ISWC 2021. ISWC 2021. Lecture Notes in Computer Science(), vol 12922. Springer, Cham. https://doi.org/10.1007/978-3-030-88361-4_31

Download citation

DOI: https://doi.org/10.1007/978-3-030-88361-4_31
Published: 30 September 2021
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-88360-7
Online ISBN: 978-3-030-88361-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

the Semantic Web Science Association (opens in a new tab)

A Framework for Quality Assessment of Semantic Annotations of Tabular Data