Document Layout Analysis is a part of Computer Vision indicating the process of identifying and categorizing the regions of interest in a document image, e.g. a scanned page. A reading system requires the segmentation of text zones from non-textual ones and the arrangement in their correct reading order.[1] Detection and labeling of the different zones (or blocks) as text body, illustrations, math symbols, and tables embedded in a document is called geometric layout analysis. But text zones play different logical roles inside the document (titles, captions, footnotes, etc.) and this kind of semantic labeling is the scope of the logical layout analysis.
Document layout analysis is the union of geometric and logical labeling. It is typically performed before a document image is sent to an OCR engine, but it can be used also to detect duplicate copies of the same document in large archives, or to index documents by their structure or pictorial content.
Document layout is formally defined in the international standard ISO 8613-1:1989.[2]
[edit] Layout Analysis Software
[edit] See also
[edit] External links
- ^ H.S. Baird. "Anatomy of a versatile page reader". Proc. of IEEE, 80(7):1056-1065, 1992
- ^ ISO 8617 "Information processing -- Text and office systems -- Office Document Architecture (ODA) and interchange format", International Organization for Standardization, 1989