|
Published Articles >> Table of Contents >> Abstract
November 1993 (Vol. 15, No. 11)
pp. 1162-1173
The Document Spectrum for Page Layout Analysis
L. O'Gorman
Full Article Text:
 
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/34.244677
Send link to a friend
| Abstract |
|
Page layout analysis is a document processing technique used to determine the format of a page. This paper describes the document spectrum (or docstrum), which is a method for structural page layout analysis based on bottom-up, nearest-neighbor clustering of page components. The method yields an accurate measure of skew, within-line, and between-line spacings and locates text lines and text blocks. It is advantageous over many other methods in three main ways: independence from skew angle, independence from different text spacings, and the ability to process local regions of different text orientations within the same image. Results of the method shown for several different page formats and for randomly oriented subpages on the same image illustrate the versatility of the method. We also discuss the differences, advantages, and disadvantages of the docstrum with respect to other lay-out methods.
|
References
|
[1] W. Postl, "Detection of linear oblique structures and skew scan in digitized documents," inProc. 8th Int. Conf. Patt. Recogn. (ICPR)(Paris), Oct. 1986, pp. 687-689.
[2] H. S. Baird, "The skew angle of printed documents," inProc. Conf. Soc. Photog. Scien. Eng.(Rochester, NY), May 1987, pp. 14-21.
[3] T. Akiyama and N. Hagita, "Automated entry system for printed documents,"Pattern Recogn., vol. 23, no. 11, pp. 1141-1153, 1990.
[4] T. Pavlidis and J. Zhou, "Page segmentation by white streams," inProc. First Int. Conf. Document Anal. Recogn. (ICDAR)(St. Malo, France), Sept. 1991, pp. 945-953.
[5] S. N. Srihari and V. Govindaraju, "Analysis of textual images using the Hough transform,"Machine Vision Applications, vol. 2, pp. 141-153, 1989.
[6] S. C. Hinds, J. L. Fisher, and D. P. D'Amato, "A document skew detection method using run-length encoding and the Hough transform," inProc. 10th Int. Conf. Pattern Recogn., 1990, pp. 464-468.
[7] A. Hashizume, P. -S. Yeh, and A. Rosenfeld, "A method of detecting the orientation of aligned components,"Patt. Recogn. Lett., vol. 4, pp. 125-132, 1986.
[8] K. Y. Wong, R. G. Casey, and F. M. Wahl, "Document analysis system,"IBM J. Res. Development, vol. 6, pp. 642-656, Nov. 1982.
[9] G. Nagy and S. Seth, "Hierarchical representation of optically scanned documents," inProc. 7th Int. Conf. Patt. Recogn. (ICPR)(Montreal, Canada), 1984, pp. 347-349.
[10] G. Nagy, S. Seth, and M. Viswanathan, "A prototype document image analysis system for technical journals,"IEEE Comput., Special issue on Document Image Analysis Systems, pp. 10-22, July 1992.
[11] H. S. Baird, S. E. Jones, and S. J. Fortune, "Image segmentation using shape-directed covers," inProc. 10th Int. Conf. Patt. Recogn. (ICPR)(Atlantic City, NJ), June 1990, pp. 820-825.
[12] J. L. Fisher, S. C. Hinds, and D. P. D'Amato. "A rule-based system for document image segmentation," inProc. 10th Int. Conf. Pattern Recogn., 1990, pp. 567-572.
[13] F. Esposito, D. Malerba, G. Semeraro, E. Annese, and G. Scafuro, "An experimental page layout recognition system for office document automatic classification: An integrated approach for inductive generalization," inProc. 10th IEEE Int. Conf. Patt. Recogn.(Atlantic City, NJ), 1990, pp. 557-562.
[14] R. O. Duda and P. E. Hart,Pattern Classification and Scene Analysis. New York: Wiley, 1973.
[15] L. O'Gorman, "Image and Document Processing Techniques for the Right Pages Electronic Library System,"Proc. 11th IAPR Int'l Conf. Pattern Recognition, Vol. II, IEEE CS Press, Los Alamitos, Calif., Order No. 2915, 1992, pp. 260-263.
[16] L. O'Gorman, "Primitives chain code", inProgress in Computer Vision and Image Processing(A. Rosenfeld and L. G. Shapiro, Eds.). San Diego: Academic, 1992, pp. 167-183
[17] H. V. Jagadish and L. O'Gorman, "An object model for image recognition,"IEEE Comput., vol. 22, no. 12, pp. 33-41, Dec. 1989.
[18] L. O'Gorman and G. I. Weil, "An approach toward segmenting contour line regions," inProc. 8th Int. Conf. Patt. Recogn.(Paris), Oct. 1986, pp. 254-258.
[19] M. Seul, L. R. Monar, L. O'Gorman, and R. Wolfe, "Morphology and local structure in labyrinthine stripe domain phases,"Sci., vol. 254, Dec. 13, 1991, pp. 1616-1618.
[20] G. A. Story, L. O'Gorman, D. Fox, L. Schaper, and H. V. Jagadish, "The RightPages: An electronic library for alerting and browsing,"IEEE Comput., pp. 17-26, 1992.
|
Additional Information
|
Index Terms- document spectrum; nearest-neighbor clustering; document image processing; docstrum; structural page layout analysis; bottom-up method; skew; within-line spacings; between-line spacings; text spacings; document image processing; image segmentation
Citation:
L. O'Gorman,
"The Document Spectrum for Page Layout Analysis,"
IEEE Transactions on Pattern Analysis and Machine Intelligence,
vol. 15,
no. 11,
pp. 1162-1173,
Nov.,
1993
|
|