Abstract
A survey of the key approaches to the semantic processing of mathematical texts is presented. A software platform prototype for the electronic storage of mathematical documents, which is based on the linked open-data (LOD) model and uses semantic information for data management, including formula-fragment searching, is proposed. The analysis of mathematical documents and the extraction of semantic information from the latter are carried out based on the electronic collection of the Izv. Vyssh. Uchebn. Zaved., Mat. (1995–2009) using special-purpose ontologies, metadata representation in the RDF (Resource Description Framework) format, and integration with existing LOD sets.
Similar content being viewed by others
References
Bol’shakova, E.I., Klyshinskii, E.S., Lande, D.V., Noskov, A.A., Peskova, O.V., and Yagunova, E.V., Avtomaticheskaya obrabotka tekstov na estestvennom yazyke i komp’yuternaya lingvistika (Automatic Processing of Text on Natural Language and Computer Linguistics), Moscow: MIEM, 2011.
Elizarov, A.M., Lipachev, E.K., and Malakhal’tsev, M.A., Veb-tekhnologii dlya matematika. Osnovy MathML (Web-Technologies for Mathematician. Foundations of MathML), Moscow: Fizmatlit, 2010
Berners-Lee, T., Linked Data — Design Issues, 2006. http://www.w3.org/DesignIssues/LinkedData.html
Biryal’tsev, E.V., Elizarov, A.M., Zhil’tsov, N.G., Ivanov, V.V., Nevzorova, O.A., and Solov’ev, V.D., Model of semantic search in mathematic document collections on the basis of ontologies, Trudy 12-i vseross. nauchn. konf. “Elektron. bibliot.: perspekt. Metody i Tekhn., elektr. koll.” (Proc. 12th All-Russ. Sci. Conf. “Electronic Libraries: Perspective Methods and Technologies, Electronic Collections”), Kazan, 2010, pp. 296–300. http://rcdl.ru/doc/2010/296-300.pdf
Elizarov, A.M., Lipachev, E.K., and Malakhal’tsev, M.A., Services of electronic natural-scientific collections, constructed on the basis of MathML technologies, Proc. All-Russ. Supercomp. Conf. “Scientific Service in Internet: Supercomputer Centers and Tasks”, Moscow, Mos. Gos. Univ., 2010, pp. 533–534. http://agora.guru.ru/abrau2010/pdf/533.pdf
Kamareddine, F. and Wells, J.B., Computerizing mathematical text with MathLang, Electr. Notes Theor. Comput. Sci., 2008, vol. 205, pp. 5–30. http://www.sciencedirect.com/science/article/pii/S1571066108001680
Kohlhase, M., OMDoc-An open markup format for mathematical documents [version 1.2], Berlin: Springer-Verlag, 2006.
David, C., Kohlhase, M., Lange, C., Rabe, F., Zhiltsov, N., and Zholudev, V., Publishing math lecture notes as linked data, Proc. 7th Extended Semantic Web Conf. (ESWC), 2010, pp. 370–375. http://arxiv.org/pdf/1004.3390.pdf
Kohlhase, M., STeX: Semantic markup in TeX/LaTeX, 2005. https://svn.kwarc.info/repos/stex/trunk/sty/stex.pdf
Dobrov, B.V. and Lukashevich, N.V., Ontology on Natural Sciences and Technologies: Structure, composition and contemporary state, Elektron. Biblioteki. 2008, vol. 11, no. 1. http://www.elbib.ru/index.phtml?page=elbib/rus/journal/2008/part1/DL
Solovyev, V. and Zhiltsov, N., Logical structure analysis of scientific publications in mathematics, Proc. Int. Conf. Web Intelligence, Mining and Semantics (WIMS-11), New York, ACM, 2011, pp. 21.2–21.9.
Nevzorova, O., Zhiltsov, N., Zaikin, D., Zhibrik, O., Kirillovich, A., Nevzorov, V., and Birialtsev, E., Bringing Math to LOD: A semantic publishing platform prototype for scientific collections in mathematics, Proc. 12th Int. Semantic Web Conf., Sydney, Australia, 2013; Berlin: Springer-Verlag, 2013, vol. 8218, Part I, pp. 379–394. http://www.slideshare.net/NikitaZhiltsov/iswc-13talk
Lancu, M., Kohlhase, M., and Rabe, F., Translating the Mizar Mathematical Library into OMDoc format, Technical Report KWARC, Report-01/11, Bremen: Jacobs University, 2011. http://kwarc.info/publications/FRabe.html
Alama, J., Brink, K., Mamane, L., and Urban, J., Large formal Wikis: issues and solutions, Intelligent Computer Mathematics, Lecture Notes in Computer Science, 2011, vol. 6824. pp. 133–148. http://arxiv.org/pdf/1107.3209.pdf
Kohlhase, M., The Planetary project: Towards eMath 3.0, Proc. Conf. on Intelligent Computer Mathematics, Bremen, 2012, Jeuring, J., Campbell, J.A., Carette, J., Reis, G., Sojka, P., Wenzel, M., and Sorge, V., Eds., Berlin: Springer-Verlag, 2012. pp. 448–452. http://kwarc.info/kohlhase/submit/mkm12-planetary.pdf
Nevzorova, O.A., Zhil’tsov, N.G., Zaikin, D.A., Zhibrik, O.N., Kirillovich, A.V., Nevzorov, V.N., and Biryal’tsev, E.V., Prototype of program platform for publication of semantic data from mathematical scientific collections in LOD cloud, Uchen. Zap. Kazan. Univ. Ser. “Fiz.-mat. nauki” 2012, vol. 154,B. 3, pp. 216–232.
Nevzorova, O. and Nevzorov, V., The development support system “OntoIntegrator” for linguistic applications, Information Sci. Comp. 2009, vol. 3, no 13, pp. 78–84. http://foibg.com/ibs-isc/ibs-13/ibs-13-p11.pdf
Stamerjohanns, H., Kohlhase, M., Ginev, D., David, C., and Miller, B., Transforming large collections of scientific publications to XML, Mathem. Comp. Sci. 2010, vol. 3, pp. 299–307. http://kwarc.info/kohlhase/papers/mcs09.pdf
Schraefel, M., Shadbolt, N., and Gibbins, N., CS AKTive space: representing computer science on the Semantic Web, Proc. www, New York: ACM, 2004, pp. 384–392. http://eprints.soton.ac.uk/259084/1/p276-schraefel.pdf
Author information
Authors and Affiliations
Corresponding author
Additional information
Original Russian Text © E.V. Biryaltsev, A.M. Elizarov, N.G. Zhiltsov, E.K. Lipachev, O.A. Nevzorova, V.D. Solovyev, 2014, published in Nauchno-Tekhnicheskaya Informatsiya, Seriya 2, 2014, No. 4, pp. 12–17.
About this article
Cite this article
Biryaltsev, E.V., Elizarov, A.M., Zhiltsov, N.G. et al. Methods for analyzing semantic data of electronic collections in mathematics. Autom. Doc. Math. Linguist. 48, 81–85 (2014). https://doi.org/10.3103/S000510551402006X
Received:
Published:
Issue Date:
DOI: https://doi.org/10.3103/S000510551402006X