$extrastylesheet
Classes | |
class | scribo::component_set< L > |
Represents all the components in a document image. More... | |
class | scribo::document< L > |
Represent document data and structure. More... | |
class | scribo::line_links< L > |
Line links representation. More... | |
class | scribo::line_set< L > |
Lines container. More... | |
class | scribo::object_groups< L > |
Object group representation. More... | |
class | scribo::object_links< L > |
Object links representation. More... | |
class | scribo::paragraph_set< L > |
Paragraph container. More... |
Scribo high level data structures.
For the moment, Scribo provides bottom-up approaches to extract data from a document. From connected components, thanks to different algorithms, lines and paragraphs are reconstructed. It implies the following workflow (see figure below).
At each level, it is usually possible to "filter" the current results using routines in namespace scribo::filter. False positive can then be invalidated.
Data structures at a specific step, usually keep track of previous parent structures. For instance, object_links keeps a reference to the component_set it is based on.
These structures are iterable and usually store several attributes for their corresponding elements.