Document recognition involves many kinds of hypotheses: segmentation hypotheses, classification hypotheses, spatial relationship hypotheses, and so on. Many recognition strategies generate valid hypotheses which are eventually rejected, but current evaluation methods consider only accepted hypotheses. As a result, we have no way to measure errors associated with rejecting valid hypotheses. We propose describing hypothesis generation in more detail, by collecting the complete set of generated hypotheses and computing the recall and precision of this set: we call these the ‘historical recall’ and ‘historical precision.’ Using table cell detection examples, we demonstrate how historical recall and precision along with the complete set of generated hypotheses assist in the evaluation, debugging, and design of recognition strategies.
Date of creation, presentation, or exhibit
Department, Program, or Center
Computer Science (GCCIS)
"Historical Recall and Precision: Summarizing Generated Hypotheses," Eighth International Conference on Document Analysis and Recognition. Held in Seoul, South Korea: 29 August - 1 September 2005 ©2005 IEEE. pps. 202-206 isbn: 0-7695-2420-6
RIT – Main Campus