Abstract.
Although the Internet is increasingly emerging as “the” widespread platform for information interchange, day-to-day work in companies still necessitates the laborious, manual processing of huge amounts of printed documents. This article presents the system smartFIX, a document analysis and understanding system developed by the DFKI spin-off insiders technologies. It enables the automatic processing of documents ranging from fixed format forms to unstructured letters of any format. In addition to the architecture, main components, and system characteristics, we also show some results from the application of smartFIX to medical bills and prescriptions.
Similar content being viewed by others
References
Altenhofen C, Stanišic-Petrovic M, Junker M, Kieninger T, Hofmann H (2002) Werkzeugeinsatz in der Dokumentenverwaltung (German). In: Computerworld Schweiz, Nr. 15/2002, S. 6-11. http://www.kodok.de/german/literat/artikel/ index\_artikel.html
Baumann S, Ben Hadj Ali M, Dengel A, Jäger T, Malburg M, Weigel A, Wenzel C (1997) Message extraction from printed documents a complete solution. In: Proceedings of the 4th international conference on document analysis and recognition (ICDAR), Ulm, Germany
Dengel A, Dubiel F (1996) Computer understanding of document structure. Int J Imag Sys Technol 7(4):271-278
Dengel A, Bleisinger R, Hoch R, Hönes F, Malburg M, Fein F (1994) OfficeMAID -- a system for automatic mail analysis, interpretation and delivery. In: Proceedings of DAS94, International Association for Pattern Recognition workshop on document analysis systems, Kaiserslautern, Germany, October 1994, pp 253-276
Dengel A, Hinkelmann K (1996) The Specialist Board - a technology workbench for document analysis and understanding. In: Tanik MM, Bastani FB, Gibson D, Fielding PJ (eds) Integrated design and process technology - IDPT96. Proceedings of the 2nd world conference, Austin, TX
Dubiel F, Dengel A (1998) FormClas -- OCR-free classification of forms. In: Hull JJ, Liebowitz S (eds) Document analysis systems II. World Scientific, Singapore, pp 189-208
Fordan A (2001) Constraint solving over OCR graphs. In: Proceedings of the 14th international conference on applications of Prolog (INAP), Tokyo, Japan
Junker M, Dengel A (2001) Preventing overfitting in learning text patterns for document categorization. In: Proceedings of the 2nd international conference on advances in pattern recognition (ICAPR2001), Rio de Janeiro, March 2001
Kieninger T, Dengel A (1998) A paper-to-HTML table converting system. In: Proceedings of DAS98, International Association for Pattern Recognition workshop on document analysis systems, Nagano, Japan, November 1998, pp 356-365
Klein B, Gökkus S, Kieninger T, Dengel A (2001) Three approaches to “industrial” table spotting. In: Proceedings of the 6th international conference on document analysis and recognition (ICDAR), Seattle
Schreiber G, Akkermans H, Anjewierden A, de Hoog R, Shadbolt N, van de Velde W, Wielinga B (1999) Knowledge engineering and management - the CommonKADS methodology. MIT Press, Cambridge, MA
Author information
Authors and Affiliations
Corresponding author
Additional information
Received: 26 June 2003, Accepted: 17 February 2004, Published online: 16 March 2004
Correspondence to: Bertin Klein
Extension of the version published in Lectures Notes in Computer Science (LNCS), vol. 2423, Springer, Heidelberg, 2002
Rights and permissions
About this article
Cite this article
Klein, B., Dengel, A.R. Problem-adaptable document analysis and understanding for high-volume applications. IJDAR 6, 167–180 (2003). https://doi.org/10.1007/s10032-004-0122-7
Issue Date:
DOI: https://doi.org/10.1007/s10032-004-0122-7