Abstract
Patent document images maintained by the U.S. patent database have a specific format, in which figures and descriptions are separated into different pages. This makes it difficult for users to refer to a figure while reading the description or vice versa. The system introduced in this paper is to prepare these patent documents for a friendly browsing interface. The system is able to segment an imaged page with several figures into individual figures and extract caption and label information from the figure. After obtaining captions and labels, figures and the relevant description are linked together, and thus users could easily refer from a description to the figure or vice versa.
Keywords
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
The U.S. Patent Database, http://www.uspto.gov/patft/help/contents.htm
Nagy, G., Seth, S., Viswanathan, M.: A Prototype Document Image Analysis System for Technical Journals. Computer 25, 10–22 (1992)
Gorman, L.: The Document Spectrum for Page Layout Analysis. IEEE Trans. Pattern Analysis and Machine Intelligence 15, 1162–1173 (1993)
Kise, K., Sato, A., Iwata, M.: Segmentation of Page Images Using the Area Voronoi Diagram. Computer Vision and Image Understanding 70, 370–382 (1998)
Wong, K.Y., Casey, R.G., Wahl, F.M.: Document analysis system. IBM Journal of Research and Development, 647–656 (1982)
Gonzalez, R., Woods, R.: Digital Image Processing, ch. 2. Addison-Wesley Publishing Company, Reading (1992)
Yuan, B., Kwoh, L.K., Tan, C.L.: Finding the best-fit bounding-boxes. In: 7th IAPR Workshop on Document Analysis Systems, New Zealand (February 13-15, 2006)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Li, L., Lu, S., Tan, C.L. (2008). A Figure Image Processing System. In: Liu, W., Lladós, J., Ogier, JM. (eds) Graphics Recognition. Recent Advances and New Opportunities. GREC 2007. Lecture Notes in Computer Science, vol 5046. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-88188-9_19
Download citation
DOI: https://doi.org/10.1007/978-3-540-88188-9_19
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-88184-1
Online ISBN: 978-3-540-88188-9
eBook Packages: Computer ScienceComputer Science (R0)