$extrastylesheet
Olena  User documentation 2.1
An Image Processing Platform
 All Classes Namespaces Functions Variables Typedefs Enumerations Enumerator Friends Groups Pages
Data structures

Classes

class  scribo::component_set< L >
 Represents all the components in a document image. More...
class  scribo::document< L >
 Represent document data and structure. More...
class  scribo::line_links< L >
 Line links representation. More...
class  scribo::line_set< L >
 Lines container. More...
class  scribo::object_groups< L >
 Object group representation. More...
class  scribo::object_links< L >
 Object links representation. More...
class  scribo::paragraph_set< L >
 Paragraph container. More...

Detailed Description

Scribo high level data structures.

For the moment, Scribo provides bottom-up approaches to extract data from a document. From connected components, thanks to different algorithms, lines and paragraphs are reconstructed. It implies the following workflow (see figure below).

At each level, it is usually possible to "filter" the current results using routines in namespace scribo::filter. False positive can then be invalidated.

Data structures at a specific step, usually keep track of previous parent structures. For instance, object_links keeps a reference to the component_set it is based on.

These structures are iterable and usually store several attributes for their corresponding elements.


comp2doc_structs.png
Data structure construction workflow and useful routines/namespaces for text extraction.