Abstract
The improvement and diversification of experimental technologies have caused a flood of data. In order to share and integrate such huge and diverse data, it is important to describe the relationship between data using Semantic Web technology. A goal of the Semantic Web is that computers can automatically process data by linking meaningful data and by forming a web of data. The Semantic Web consists of key technologies such as Resource Description Framework (RDF), ontologies, triple stores (database for RDF), and SPARQL Protocol and RDF Query Language (SPARQL), which is a query language for triple stores. Although the Semantic Web has been used by some specific domains such as government and media, recently it is also applied to the life sciences. In this chapter, I will describe about the Semantic Web and its application to life science including glycobiology. Finally, I will introduce TogoTable, which is a web application using the Semantic Web, used for collecting annotations from distributed databases.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Aoki-Kinoshita KF (2009) KCF format. In: Glycome informatics: methods and applications. Chapman and Hall/CRC, Boca Raton, pp 31–32
Aoki-Kinoshita KF, Bolleman J, Campbell MP et al (2013a) Introducing glycomics data into the Semantic Web. J Biomed Semant 4:39
Aoki-Kinoshita KF, Sawaki H, An HJ et al (2013b) The third ACGG-DB meeting report: towards an international collaborative infrastructure for glycobioinformatics. Glycobiology 23:144–146
Banin E, Neuberger Y, Altshuler Y et al (2002) A novel linear code® nomenclature for complex carbohydrates. Trends Glycosci Glycotechnol 14:127–137
Belleau F, Nolin MA, Tourigny N et al (2008) Bio2RDF: towards a mashup to build bioinformatics knowledge systems. J Biomed Inform 41:706–716
Berners-Lee T, Hendler J, Lassila O (2001) The Semantic Web. Sci Am 284:28–37
Bohne-Lang A, Lang E, Förster T et al (2001) LINUCS: linear notation for unique description of carbohydrate sequences. Carbohydr Res 336:1–11
Bushman B, Anderson D, Fu G (2015) Transforming the medical subject headings into linked data: creating the authorized version of MeSH in RDF. J Libr Metadata 15:157–176
Campbell MP, Peterson R, Mariethoz J et al (2014) UniCarbKB: building a knowledge platform for glycoproteomics. Nucleic Acids Res 42:D215–D221
Chiba H, Nishide H, Uchiyama I (2015) Construction of an ortholog database using the semantic web technology for integrative analysis of genomic data. PLoS One 10:e0122802
Doubet S, Albersheim P (1992) CarbBank. Glycobiology 2:505
Egorova KS, Toukach PV (2014) Expansion of coverage of Carbohydrate Structure Database (CSDB). Carbohydr Res 389:112–114
Erling O, Mikhailov I (2009) RDF support in the virtuoso DBMS. In: Tassilo Pellegrini T, Auer S, Tochtermann K et al (eds) Networked knowledge – networked media. Springer, Berlin/Heidelberg, pp 7–24
Faulconbridge A, Burdett T, Brandizi M et al (2014) Updates to BioSamples database at European bioinformatics institute. Nucleic Acids Res 42:D50–D52
Fu G, Batchelor C, Dumontier M et al (2015) PubChemRDF: towards the semantic annotation of PubChem compound and substance databases. J Cheminform 7:34
Hashimoto K, Goto S, Kawano S et al (2006) KEGG as a glycome informatics resource. Glycobiology 16:63R–70R
IUPAC-IUB Joint Commission on Biochemical Nomenclature (JCBN) (1982) Abbreviated terminology of oligosaccharide chains. Recommendations 1980. J Bio Chem 257:3347–3351
Jupp S, Malone J, Bolleman J et al (2014) The EBI RDF platform: linked open data for the life sciences. Bioinformatics 30:1338–1339
Kaji H, Shikanai T, Sasaki-Sawa A et al (2012) Large-scale identification of N-glycosylated proteins of mouse tissues and construction of a glycoprotein database, GlycoProtDB. J Proteome Res 11:4553–4566
Katayama T, Arakawa K, Nakao M et al (2010) The DBCLS BioHackathon: standardization and interoperability for bioinformatics web services and workflows. J Biomed Semant 1:8
Katayama T, Wilkinson MD, Vos R et al (2011) The 2nd DBCLS BioHackathon: interoperable bioinformatics web services for integrated applications. J Biomed Semant 2:4
Katayama T, Wilkinson MD, Micklem G et al (2013) The 3rd DBCLS BioHackathon: improving life science data integration with Semantic Web technologies. J Biomed Semant 4:6
Katayama T, Wilkinson MD, Aoki-Kinoshita KF et al (2014) BioHackathon series in 2011 and 2012: penetration of ontology and linked data in life science domains. J Biomed Semant 5:5
Kawano S, Watanabe T, Mizuguchi S et al (2014) TogoTable: cross-database annotation system using the Resource Description Framework (RDF) data model. Nucleic Acids Res 42:W442–W448
Kinjo AR, Suzuki H, Yamashita R et al (2012) Protein Data Bank Japan (PDBj): maintaining a structural data archive and resource description framework format. Nucleic Acids Res 40:D453–D460
Kobilarov G, Scott T, Raimond Y et al (2009) Media meets semantic web–how the bbc uses dbpedia and linked data to make connections. In: Aroyo L, Traverso P, Ciravegna F et al (eds) The semantic web: research and applications. Springer, Berlin/Heidelberg, pp 723–737
Okuda S, Nakao H, Kawasaki T (2014) GlycoEpitope: database for carbohydrate antigen and antibody. In: Taniguchi N, Endo T, Hart GW et al (eds) Glycoscience: biology and medicine. Springer, Tokyo, pp 267–273
Ranzinger R, Herget S, von der Lieth CW et al (2011) GlycomeDB-a unified database for carbohydrate structures. Nucleic Acids Res 39:D373–D376
Ranzinger R, Aoki-Kinoshita KF, Campbell MP et al (2015) GlycoRDF: an ontology to standardize glycomics data in RDF. Bioinfomatics 31:919–925
Shadbolt N, O’Hara K, Berners-Lee T et al (2012) Linked open government data: lessons from data.gov.uk. IEEE Intell Syst 27:16–24
Tanaka K, Aoki-Kinoshita KF, Kotera M et al (2014) WURCS: the Web3 unique representation of carbohydrate structures. J Chem Inf Model 54:1558–1566
The Gene Ontology Consortium, Ashburner M, Ball CA et al (2000) Gene ontology: tool for the unification of biology. Nat Genet 25:25–29
The UniProt Consortium (2014) Activities at the Universal Protein Resource (UniProt). Nucleic Acids Res 42:D191–D198
Willighagen EL, Waagmeester A, Spjuth O et al (2013) The ChEMBL database as linked open data. J Cheminform 5:23
Wimalaratne SM, Grenon P, Hermjakob H et al (2014) BioModels linked dataset. BMC Syst Biol 8:91
Wimalaratne SM, Bolleman J, Juty N et al (2015) SPARQL-enabled identifier conversion with Identifiers.org. Bioinformatics 31:1875–1877
Wu H, Fujiwara T, Yamamoto Y et al (2014) BioBenchmark Toyama 2012: an evaluation of the performance of triple stores on biological data. J Biomed Semant 5:32
York WS, Agravat S, Aoki-Kinoshita KF et al (2014) MIRAGE: the minimum information required for a glycomics experiment. Glycobiology 24:402–406
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer Japan
About this chapter
Cite this chapter
Kawano, S. (2017). Glycobiology Meets the Semantic Web. In: Aoki-Kinoshita, K. (eds) A Practical Guide to Using Glycomics Databases. Springer, Tokyo. https://doi.org/10.1007/978-4-431-56454-6_17
Download citation
DOI: https://doi.org/10.1007/978-4-431-56454-6_17
Published:
Publisher Name: Springer, Tokyo
Print ISBN: 978-4-431-56452-2
Online ISBN: 978-4-431-56454-6
eBook Packages: Biomedical and Life SciencesBiomedical and Life Sciences (R0)