Students: Tessa Triolo and Dede Russell
Dr. Elisa Barney Smith & Dr. Tim Andersen
This research was funded by a grant from the Computing Research Association, Committee on the Status of Women in Computing Research’s CREU: Collaborative Research Experience for Undergraduates in Computer Science and Engineering project.
Often documents are poorly illuminated when they are scanned or have yellowed with aged causing an uneven background color.
to convert the image into a text document, the image is passed through an Optical Character Recognition (OCR) algorithm. Most OCR algorithms process only input images that are black and white, without intermediate gray levels. Therefore the image must be thresholded. The simplest thresholding algorithm is a global threshold. That doesn’t work well on images with varying background content.