Abstract
                                                                        A document image is composed of a variety of physical entities or regions such as text  blocks, lines, words, figures, tables, and background. We could also assign functional or  logical labels such as sentences, titles, captions, author names, and addresses to some of  these regions. The process of document structure and layout analysis tries to decompose  a given document image into its component regions and understand their functional  roles and relationships. The processing is carried out in multiple steps, such as preprocessing, page decomposition, structure understanding, etc. We will look into each of  these steps in detail in the following sections.  Document images are often generated from ph