Document Layout Analysis

Document Layout Analysis

Document Layout Analysis is a part of Computer Vision indicating the process of identifying and categorizing the regions of interest in a document image, e.g. a scanned page. A reading system requires the segmentation of text zones from non-textual ones and the arrangement in their correct reading order.[1] Detection and labeling of the different zones (or blocks) as text body, pictures, math symbols, and tables embedded in a document is called geometric layout analysis. But text zones play different logical roles inside the document (titles, captions, footnotes, etc.) and this kind of semantic labeling is the scope of the logical layout analysis.

Document layout analysis is the union of geometric and logical labeling. It is typically performed before a document image is sent to an OCR engine, but it can be used also to detect duplicate copies of the same document in large archives, or to index documents by their structure or pictorial content.

Document layout is formally defined in the international standard ISO 8613-1:1989.[2]

Contents

Layout Analysis Software

See also

External links

Notes

  1. ^ H.S. Baird. "Anatomy of a versatile page reader". Proc. of IEEE, 80(7):1056-1065, 1992
  2. ^ ISO 8617 "Information processing -- Text and office systems -- Office Document Architecture (ODA) and interchange format", International Organization for Standardization, 1989

Wikimedia Foundation. 2010.

Игры ⚽ Нужна курсовая?

Look at other dictionaries:

  • Document — For the R.E.M. album, see Document (album). For the similarly named surrealist journal, see Documents (magazine). The term document has more meanings in ordinary language and in scholarship. WordNet 3.1. lists four meanings (October 2011):… …   Wikipedia

  • Integrated circuit layout design protection — Layout designs (topographies) of integrated circuits are a field in the protection of intellectual property. Like most of the other forms of intellectual property, IC layout designs are creations of the human mind. They are usually the result of… …   Wikipedia

  • OCRopus — Developer(s) Thomas Breuel, DFKI Initial release 9 April 2007[1] Preview release 0.4.4 (alpha) / May 1, 2010; 18 months ago (2010 05 01 …   Wikipedia

  • Tesseract (software) — Infobox Software name = Tesseract caption = author = Ray Smith, Hewlett Packard cite web|url = http://code.google.com/p/tesseract ocr/|title = tesseract ocr|accessdate = 2008 07 12|last = Google|authorlink = |year = 2008] developer = Google… …   Wikipedia

  • OCRFeeder — Developer(s) Joaquim Rocha (Igalia) …   Wikipedia

  • HOCR (software) — Infobox Software name = HOCR caption = author = Yaacov Zamir developer = released = latest release version = latest release date = latest preview version = latest preview date = programming language = C, Python and C++ operating system = Linux… …   Wikipedia

  • Integrated circuit design — Layout view of a simple CMOS Operational Amplifier ( inputs are to the left and the compensation capacitor is to the right ). The metal layers are colored blue and green, the polysilicon is red and vias are crosses. Integrated circuit design, or… …   Wikipedia

  • Microsoft Office 2007 — applications shown on Windows Vista (clockwise from top left: Excel, Word, OneNote, PowerPoint …   Wikipedia

  • List of file formats — This is an incomplete list, which may never be able to satisfy particular standards for completeness. You can help by expanding it with reliably sourced entries. See also: List of file formats (alphabetical) This is a list of file formats… …   Wikipedia

  • Wikipedia:Featured article candidates — Here, we determine which articles are to be featured articles (FAs). FAs exemplify Wikipedia s very best work and satisfy the FA criteria. All editors are welcome to review nominations; please see the review FAQ. Before nominating an article,… …   Wikipedia

Share the article and excerpts

Direct link
Do a right-click on the link above
and select “Copy Link”