The table recognition literature contains many strategies specified informally as a sequence of operations, obscuring both models of table structure and the effects of individual decisions. Decision making is more transparent in formal model-based approaches (e.g. grammar-based) but these approaches are less flexible than informal ones. We propose an intermediate level of formalization, defining strategies as a sequence of basic graph transformations that correspond to recognition operations (e.g. classification, segmentation). Transformations are parameterized by logical types and decision functions, which together define structure models and executable strategies for interpreting input graphs. We provide an overview of our first attempt at this intermediate level of formalization, the Recognition Strategy Language (RSL). As a proof-of-concept, we reimplement two informally specified table recognition strategies from the literature in RSL. The RSL implementations capture descriptions of the formerly implicit table structure models, and automatically capture all decision making.
Date of creation, presentation, or exhibit
Department, Program, or Center
Computer Science (GCCIS)
"The Recognition Strategy Language," Eighth International Conference on Document Analysis and Recognition. Held at Seoul, South Korea: 29 August - 1 September 2005. pps. 565 - 569 issn: 0-7695-2420-6
RIT – Main Campus