Skip to main content

Harmonizing Big Data with a Knowledge Graph: OceanGraph KG Uses Case

  • Conference paper
  • First Online:
Cloud Computing, Big Data & Emerging Topics (JCC-BD&ET 2020)

Abstract

In this paper we introduce recent efforts carried out by the OceanGraph KG project to integrate semi-structured or unstructured content. We present some of the practical applications of OceanGraph through use cases, and finally summarize the lessons learned during the development process.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    https://eshorizonte2020.es/.

  2. 2.

    https://googleblog.blogspot.com/2012/05/introducing-knowledge-graph-things-not.html.

  3. 3.

    http://www.datosdelmar.mincyt.gob.ar/index.php.

  4. 4.

    https://www.gbif.org/.

  5. 5.

    http://www.iobis.org/.

  6. 6.

    https://www.springernature.com/gp/researchers/scigraph.

  7. 7.

    https://www.w3.org/2004/02/skos/.

  8. 8.

    http://www.opengeospatial.org/.

  9. 9.

    https://github.com/oborel/obo-relations.

  10. 10.

    http://silkframework.org/.

  11. 11.

    https://terms.tdwg.org/wiki/dwc:recordedBy.

  12. 12.

    http://graphdb.ontotext.com/.

  13. 13.

    http://web.cenpat-conicet.gob.ar:7200/sparql?savedQueryName=OG-Q001.

References

  1. Malik, T., Foster, I.: Addressing data access needs of the long-tail distribution of geoscientists. In: 2012 IEEE International Geoscience and Remote Sensing Symposium (IGARSS), pp. 5348–5351. IEEE (2012)

    Google Scholar 

  2. Hardisty, A., Roberts, D.: A decadal view of biodiversity informatics: challenges and priorities. BMC Ecol. 13(1), 16 (2013). https://doi.org/10.1186/1472-6785-13-16

    Article  Google Scholar 

  3. Beyer, M.A., Laney, D.: The importance of “big data”: a definition, pp. 2014–2018. Gartner, Stamford, CT (2012)

    Google Scholar 

  4. Liu, Y., Qiu, M., Liu, C., Guo, Z.: Big data in ocean observation: opportunities and challenges. In: Wang, Y., Yu, G., Zhang, Y., Han, Z., Wang, G. (eds.) BigCom 2016. LNCS, vol. 9784, pp. 212–222. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-42553-5_18

    Chapter  Google Scholar 

  5. Campbell, P.: Data’s shameful neglect. Nature 461(7261), 145 (2009)

    Google Scholar 

  6. Lomotey, R.K., Deters, R.: Terms extraction from unstructured data silos. In: 2013 8th International Conference on System of Systems Engineering (SoSE), pp. 19–24. IEEE (2013)

    Google Scholar 

  7. Berners-Lee, T., Hendler, J., Lassila, O., et al.: The semantic web. Sci. Am. 284(5), 28–37 (2001)

    Article  Google Scholar 

  8. Bizer, C., Heath, T., Berners-Lee, T.: Linked data: the story so far. In: Semantic Services, Interoperability and Web Applications: Emerging Concepts, pp. 205–227. IGI Global (2011)

    Google Scholar 

  9. Bukhari, S.A.C., Nagy, M.L., Ciccarese, P., Krauthammer, M., Baker, C.J.: iCyrus: a semantic framework for biomedical image discovery. In: SWAT4LS, pp. 13–22 (2015)

    Google Scholar 

  10. Bukhari, S.A.C.: Semantic enrichment and similarity approximation for biomedical sequence images. Ph.D. thesis, University of New Brunswick (Canada) (2017)

    Google Scholar 

  11. Ehrlinger, L.,Wöß, W.: Towards a definition of knowledge graphs. In: SEMANTiCS (Posters, Demos, SuCCESS), vol. 48 (2016)

    Google Scholar 

  12. Ceravolo, P., et al.: Big data semantics. J. Data Semant. 7(2), 65–85 (2018). https://doi.org/10.1007/s13740-018-0086-2

    Article  Google Scholar 

  13. Leadbetter, A., Arko, R., Chandler, C., Shepherd, A., Lowry, R.: Linked data an oceanographic perspective. J. Ocean Technol. 8(3), 7–12 (2013)

    Google Scholar 

  14. Leadbetter, A., Lowry, R., Clements, D.O.: The NERC vocabulary server: version 2.0. In: Geophysical Research Abstracts, vol. 14 (2012)

    Google Scholar 

  15. Krisnadhi, A., et al.: An ontology pattern for oceanographic cruises: towards an oceanographer’s dream of integrated knowledge discovery (2014)

    Google Scholar 

  16. Cheatham, M., et al.: The GeoLink knowledge graph. Big Earth Data 2(2), 131–143 (2018)

    Article  Google Scholar 

  17. Page, R.D.M.: Ozymandias: a biodiversity knowledge graph. PeerJ 7, e6739 (2019)

    Article  Google Scholar 

  18. Springer Nature SciGraph (2018). http://www.springernature.com/gp/researchers/scigraph. Accessed 24 Jan 2019

  19. Zárate, M., Rosales, P., Fillottrani, P., Delrieux, C., Lewis, M.:Oceanographic data management: towards the publishing of Pampa Azul oceanographic campaigns as linked data. In: Proceedings of the 12th Alberto Mendelzon International Workshop on Foundations of Data Management, AMW 2018 (2018)

    Google Scholar 

  20. Zárate, M., Rosales, P., Braun, G., Lewis, M., Fillottrani, P.R., Delrieux, C.: OceanGraph: some initial steps toward a oceanographic knowledge graph. In: Villazón-Terrazas, B., Hidalgo-Delgado, Y. (eds.) KGSWC 2019. CCIS, vol. 1029, pp. 33–40. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-21395-4_3

    Chapter  Google Scholar 

  21. The Integrated Ocean Observing System (IOOS) (2013). https://ioos.noaa.gov/. Accessed 19 July 2019

  22. Battle, R., Kolas, D.: Enabling the geospatial semantic web with parliament and GeoSPARQL. Semant. Web 3(4), 355–370 (2012)

    Article  Google Scholar 

  23. SPARQL query language for RDF (2008). https://www.w3.org/TR/rdf-sparql-protocol/. Accessed 10 Mar 2019

  24. Wieczorek, J., et al.: Darwin Core: an evolving community-developed biodiversity data standard. PLoS One 7(1), e29715 (2012)

    Article  Google Scholar 

  25. Zárate, M., Braun, G., Fillottrani, P.R., Delrieux, C., Lewis, M.: BiGe-Onto: an ontology-based system for managing biodiversity and biogeography data. Appl. Ontol. J. (2019, accepted paper)

    Google Scholar 

  26. Buttigieg, P.L., Pafilis, E., Lewis, S.E., Schildhauer, M.P., Walls, R.L., Mungall, C.J.: The environment ontology in 2016: bridging domains with increased scope, semantic density, and interoperation. J. Biomed. Semant. 7, 57 (2016). https://doi.org/10.1186/s13326-016-0097-6

    Article  Google Scholar 

  27. W3C: Semantic Sensor Network Ontology (SSN) W3C Recommendation (2017)

    Google Scholar 

  28. Zárate, M., Buckle, C., Mazzanti, R., Fillottrani, P., Delrieux, C., Lewis, M.: OceanGraph RDF dataset (2020). https://doi.org/10.17632/9t5xkt9wwk.1. Accessed 18 Mar 2019

  29. Michele Pasin and FigShare Admin SN SciGraph. Dataset: Persons, April 2019

    Google Scholar 

  30. Time Ontology in OWL W3C Recommendation 19 October 2017 (2017). https://www.w3.org/TR/owl-time/. Accessed 27 Jan 2020

  31. Bikakis, N., Sellis, T.: Exploration and visualization in the web of big linked data: a survey of the state of the art. arXiv preprint arXiv:1601.08059 (2016)

  32. Hernández-Pérez, T.: In the age of the web of data: first open data, then big data. El profesional de la información (EPI) 25(4), 517–525 (2016)

    Article  Google Scholar 

  33. Hendler, J.: Broad data: exploring the emerging web of data. Big Data 1(1), 18–20 (2013)

    Article  MathSciNet  Google Scholar 

  34. Manyika, J.: Big data: the next frontier for innovation, competition, and productivity (2011). http://www.mckinsey.com/Insights/MGI/Research/Technology_and_Innovation/Big_data_The_next_frontier_for_innovation. Accessed 29 Jan 2020

Download references

Acknowledgments

This work is partially funded by project Linked Open Data Platform for Management and Visualization of Primary Data in Marine Science. Supported by Secretariat of Science and Technology of the National University of Patagonia San Juan Bosco (UNPSJB). Some of the data used were provided by the Golfo San Jorge Research and Transfer Center (CIT-GSJ-CONICET).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Marcos Zárate .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Zárate, M., Buckle, C., Mazzanti, R., Lewis, M., Fillottrani, P., Delrieux, C. (2020). Harmonizing Big Data with a Knowledge Graph: OceanGraph KG Uses Case. In: Rucci, E., Naiouf, M., Chichizola, F., De Giusti, L. (eds) Cloud Computing, Big Data & Emerging Topics. JCC-BD&ET 2020. Communications in Computer and Information Science, vol 1291. Springer, Cham. https://doi.org/10.1007/978-3-030-61218-4_6

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-61218-4_6

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-61217-7

  • Online ISBN: 978-3-030-61218-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics