skip to main content
10.1145/3151509.3151526acmotherconferencesArticle/Chapter ViewAbstractPublication PageshipConference Proceedingsconference-collections
research-article

Mass Digitization of Archival Documents using Mobile Phones

Published:10 November 2017Publication History

ABSTRACT

Digital copies of historical documents are needed for the Digital Humanities. Currently, cameras of standard mobile phones are able to capture documents with a resolution of about 330 dpi for document sizes up to DIN A4 (German standard, 297 x 210 mm), which allows a digitization of documents using a standard device. Thus, scholars are able to take images of documents in archives themselves without the need of book scanners or other devices. This paper presents a scanning app, which comprises a real time page detection, quality assessment (focus measure) and an automated detection of a page turn over if books are scanned. Additionally, a portable device - the ScanTent - to place the mobile phone during scanning is presented. The page detection is evaluated on the ICDAR2015 SmartDoc competition dataset and shows a reliable page detection with an average Jaccard index of 75%.

References

  1. JF Brenner, BS Dew, and JB Horton. 1976. An automated microscope for cytologic research. J Histochen Cytochem 24 (1976).Google ScholarGoogle Scholar
  2. J. Burie, J. Chazalon, M. Coustaty, S. Eskenazi, M. M. Luqman, M. Mehri, N. Nayef, J. Ogier, S. Prum, and M. Rusiñol. 2015. ICDAR2015 competition on smartphone document capture and OCR (SmartDoc). In 13th International Conference on Document Analysis and Recognition (ICDAR). 1161--1165. https://doi.org/10.1109/ICDAR.2015.7333943 Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Fototechnischer Ausschuss der KLA. 2016. Wirtschaftliche Digitalisierung in Archiven. (2016).Google ScholarGoogle Scholar
  4. Lawrence Firestone, Kitty Cook, Kevin Culp, Neil Talsania, and Kendall Preston. 1991. Comparison of autofocus methods for automated microscopy. Cytometry 12, 3 (1991), 195--206. https://doi.org/10.1002/cyto.990120302 Google ScholarGoogle ScholarCross RefCross Ref
  5. J. Kumar, F. Chen, and D. Doermann. 2012. Sharpness estimation for document and scene images. In Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012). 3292--3295.Google ScholarGoogle Scholar
  6. J Matas, O Chum, M Urban, and T Pajdla. 2004. Robust wide-baseline stereo from maximally stable extremal regions. Image and Vision Computing 22, 10 (2004), 761--767. https://doi.org/10.1016/j.imavis.2004.02.006 British Machine Vision Computing 2002. Google ScholarGoogle ScholarCross RefCross Ref
  7. Said Pertuz, Domenec Puig, and Miguel Angel Garcia. 2013. Analysis of Focus Measure Operators for Shape-from-focus. Pattern Recognition 46, 5 (May 2013), 1415--1432. https://doi.org/10.1016/j.patcog.2012.11.011 Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. M. Subbarao and J. K. Tyan. 1998. Selecting the optimal focus measure for autofocusing and depth-from-focus. IEEE Transactions on Pattern Analysis and Machine Intelligence 20, 8 (Aug 1998), 864--870. https://doi.org/10.1109/34.709612 Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. D. Vaquero, N. Gelfand, M. Tico, K. Pulli, and M. Turk. 2011. Generalized autofocus. In 2011 IEEE Workshop on Applications of Computer Vision (WACV). 511--518. https://doi.org/10.1109/WACV.2011.5711547 Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Zoran Zivkovic. 2004. Improved Adaptive Gaussian Mixture Model for Background Subtraction. In Proceedings of the 17th International Conference on Pattern Recognition (ICPR'04) (ICPR '04). IEEE Computer Society, Washington, DC, USA, 28--31. https://doi.org/10.1109/ICPR.2004.479 Google ScholarGoogle ScholarCross RefCross Ref
  11. Zoran Zivkovic and Ferdinand van der Heijden. 2006. Efficient Adaptive Density Estimation Per Image Pixel for the Task of Background Subtraction. Pattern Recognition Letters 27, 7 (May 2006), 773--780. https://doi.org/10.1016/j.patrec.2005.11.005 Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Mass Digitization of Archival Documents using Mobile Phones

          Recommendations

          Comments

          Login options

          Check if you have access through your login credentials or your institution to get full access on this article.

          Sign in
          • Published in

            cover image ACM Other conferences
            HIP '17: Proceedings of the 4th International Workshop on Historical Document Imaging and Processing
            November 2017
            129 pages
            ISBN:9781450353908
            DOI:10.1145/3151509

            Copyright © 2017 ACM

            Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

            Publisher

            Association for Computing Machinery

            New York, NY, United States

            Publication History

            • Published: 10 November 2017

            Permissions

            Request permissions about this article.

            Request Permissions

            Check for updates

            Qualifiers

            • research-article
            • Research
            • Refereed limited

            Acceptance Rates

            HIP '17 Paper Acceptance Rate19of33submissions,58%Overall Acceptance Rate52of90submissions,58%

          PDF Format

          View or Download as a PDF file.

          PDF

          eReader

          View online with eReader.

          eReader