Skip to main content

Advertisement

Log in

Problem-adaptable document analysis and understanding for high-volume applications

  • Published:
Document Analysis and Recognition Aims and scope Submit manuscript

Abstract.

Although the Internet is increasingly emerging as “the” widespread platform for information interchange, day-to-day work in companies still necessitates the laborious, manual processing of huge amounts of printed documents. This article presents the system smartFIX, a document analysis and understanding system developed by the DFKI spin-off insiders technologies. It enables the automatic processing of documents ranging from fixed format forms to unstructured letters of any format. In addition to the architecture, main components, and system characteristics, we also show some results from the application of smartFIX to medical bills and prescriptions.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Similar content being viewed by others

References

  1. Altenhofen C, Stanišic-Petrovic M, Junker M, Kieninger T, Hofmann H (2002) Werkzeugeinsatz in der Dokumentenverwaltung (German). In: Computerworld Schweiz, Nr. 15/2002, S. 6-11. http://www.kodok.de/german/literat/artikel/ index\_artikel.html

  2. Baumann S, Ben Hadj Ali M, Dengel A, Jäger T, Malburg M, Weigel A, Wenzel C (1997) Message extraction from printed documents a complete solution. In: Proceedings of the 4th international conference on document analysis and recognition (ICDAR), Ulm, Germany

  3. Dengel A, Dubiel F (1996) Computer understanding of document structure. Int J Imag Sys Technol 7(4):271-278

    Google Scholar 

  4. Dengel A, Bleisinger R, Hoch R, Hönes F, Malburg M, Fein F (1994) OfficeMAID -- a system for automatic mail analysis, interpretation and delivery. In: Proceedings of DAS94, International Association for Pattern Recognition workshop on document analysis systems, Kaiserslautern, Germany, October 1994, pp 253-276

  5. Dengel A, Hinkelmann K (1996) The Specialist Board - a technology workbench for document analysis and understanding. In: Tanik MM, Bastani FB, Gibson D, Fielding PJ (eds) Integrated design and process technology - IDPT96. Proceedings of the 2nd world conference, Austin, TX

  6. Dubiel F, Dengel A (1998) FormClas -- OCR-free classification of forms. In: Hull JJ, Liebowitz S (eds) Document analysis systems II. World Scientific, Singapore, pp 189-208

  7. Fordan A (2001) Constraint solving over OCR graphs. In: Proceedings of the 14th international conference on applications of Prolog (INAP), Tokyo, Japan

  8. Junker M, Dengel A (2001) Preventing overfitting in learning text patterns for document categorization. In: Proceedings of the 2nd international conference on advances in pattern recognition (ICAPR2001), Rio de Janeiro, March 2001

  9. Kieninger T, Dengel A (1998) A paper-to-HTML table converting system. In: Proceedings of DAS98, International Association for Pattern Recognition workshop on document analysis systems, Nagano, Japan, November 1998, pp 356-365

  10. Klein B, Gökkus S, Kieninger T, Dengel A (2001) Three approaches to “industrial” table spotting. In: Proceedings of the 6th international conference on document analysis and recognition (ICDAR), Seattle

  11. Schreiber G, Akkermans H, Anjewierden A, de Hoog R, Shadbolt N, van de Velde W, Wielinga B (1999) Knowledge engineering and management - the CommonKADS methodology. MIT Press, Cambridge, MA

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Bertin Klein.

Additional information

Received: 26 June 2003, Accepted: 17 February 2004, Published online: 16 March 2004

Correspondence to: Bertin Klein

Extension of the version published in Lectures Notes in Computer Science (LNCS), vol. 2423, Springer, Heidelberg, 2002

Rights and permissions

Reprints and permissions

About this article

Cite this article

Klein, B., Dengel, A.R. Problem-adaptable document analysis and understanding for high-volume applications. IJDAR 6, 167–180 (2003). https://doi.org/10.1007/s10032-004-0122-7

Download citation

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10032-004-0122-7

Keywords:

Navigation