skip to main content
10.1145/2034691.2034703acmconferencesArticle/Chapter ViewAbstractPublication PagesdocengConference Proceedingsconference-collections
demonstration

The art of mathematics retrieval

Published:19 September 2011Publication History

ABSTRACT

The design and architecture of MIaS (Math Indexer and Searcher), a system for mathematics retrieval is presented, and design decisions are discussed. We argue for an approach based on Presentation MathML using a similarity of math subformulae. The system was implemented as a math-aware search engine based on the state-of-the-art system Apache Lucene.

Scalability issues were checked against more than 400,000 arXiv documents with 158 million mathematical formulae. Almost three billion MathML subformulae were indexed using a Solr-compatible Lucene.

References

  1. \c S. Anca. Natural Language and Mathematics Processing for Applicable Theorem Search. Master's thesis, Jacobs University, Bremen, Aug. 2009. https://svn.eecs.jacobs-university.de/svn/eecs/archive/msc-2009/aanca.pdf.Google ScholarGoogle Scholar
  2. D. Archambault and V. Moco. Canonical MathML to Simplify Conversion of MathML to Braille Mathematical Notations. In K. Miesenberger, J. Klaus, W. Zagler, and A. Karshmer, editors, Computers Helping People with Special Needs, volume 4061 of Lecture Notes in Computer Science, pages 1191--1198. Springer Berlin / Heidelberg, 2006. http://dx.doi.org/10.1007/11788713_172. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. M. Líaka. Vyhledávání v matematickém textu (in Slovak), Searching Mathematical Texts, 2010. Bachelor Thesis, Masaryk University, Brno, Faculty of Informatics (advisor: Petr Sojka), https://is.muni.cz/th/255768/fi_b/?lang=en.Google ScholarGoogle Scholar
  4. M. Líaka, P. Sojka, M. R°u~icka, and P. Mravec. Web Interface and Collection for Mathematical Retrieval. In P. Sojka and T. Bouche, editors, Proceedings of DML 2011, pages 77--84, Bertinoro, Italy, July 2011. Masaryk University. http://www.fi.muni.cz/ sojka/dml-2011-program.html.Google ScholarGoogle Scholar
  5. J. Miautka and L. Galamboa. Extending Full Text Search Engine for Mathematical Content. In P. Sojka, editor, Proceedings of DML 2008, pages 55--67, Birmingham, UK, July 2008. Masaryk University. http://dml.cz/dmlcz/702546.Google ScholarGoogle Scholar
  6. R. Munavalli and R. Miner. MathFind: A Math-Aware Search Engine. In Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval, SIGIR,'06, pages 735--735, New York, NY, USA, 2006. ACM. http://doi.acm.org/10.1145/1148170.1148348. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. P. Sojka and M. Líaka. Indexing and Searching Mathematics in Digital Libraries -- Architecture, Design and Scalability Issues. In J. H. Davenport, W.M. Farmer, J. Urban and F. Rabe, editors, Proceedings of CICM Conference 2011 (Calculemus/MKM), volume 6824 of Lecture Notes in Artificial Intelligence, LNAI, pages 228--243, Berlin, Germany, July 2011. Springer\discretionary-Verlag. http://dx.doi.org/10.1007/978-3-642-22673-1_16. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. H. Stamerjohanns, M. Kohlhase, D. Ginev, C. David, and B. Miller. Transforming Large Collections of Scientific Publications to XML. Mathematics in Computer Science, 3:299--307, 2010. http://dx.doi.org/10.1007/s11786-010-0024-7.Google ScholarGoogle Scholar
  9. W. Sylwestrzak, J. Borbinha, T. Bouche, A. Nowinski, and P. Sojka. EuDML--Towards the European Digital Mathematics Library. In P. Sojka, editor, Proceedings of DML 2010, pages 11--24, Paris, France, July 2010. Masaryk University. http://dml.cz/dmlcz/702569.Google ScholarGoogle Scholar

Index Terms

  1. The art of mathematics retrieval

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Conferences
          DocEng '11: Proceedings of the 11th ACM symposium on Document engineering
          September 2011
          296 pages
          ISBN:9781450308632
          DOI:10.1145/2034691

          Copyright © 2011 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 19 September 2011

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • demonstration

          Acceptance Rates

          Overall Acceptance Rate178of537submissions,33%

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader