The detection and extraction of text regions in an image is a well known problem in the computer vision research area. The goal of this project is to compare two basic approaches to text extraction in natural (non-document) images: edge-based and connected-component based. The algorithms are implemented and evaluated using a set of images of natural scenes that vary along the dimensions of lighting, scale and orientation. Accuracy, precision and recall rates for each approach are analyzed to determine the success and limitations of each approach. Recommendations for improvements are given based on the results.
Computer Science (MS)
Department, Program, or Center
Computer Science (GCCIS)
Sharma, Sneha, "Extraction of text regions in natural images" (2007). Thesis. Rochester Institute of Technology. Accessed from
RIT – Main Campus