Article ID Journal Published Year Pages File Type
1038788 Journal of Cultural Heritage 2008 8 Pages PDF
Abstract

For further processing of document images, the ink pixels must be separated from the background pixels. This paper presents a new method for thresholding images of historical documents. The main objective is to create monochromatic images with high quality at low processing time. This allows easier access to the contents of the image files. One important problem arises when the document is written on both sides of the paper. The thresholding process can lose the contents of the document completely if the separation between the ink and the background is not correctly defined. We present a new efficient algorithm for binarization of historical documents and we analyze its performance by comparing it to other nineteen classic thresholding algorithms using measures like precision, recall, accuracy, specificity and a fidelity index. Our method achieved better results than other well-known algorithms.

Related Topics
Physical Sciences and Engineering Chemistry Physical and Theoretical Chemistry
Authors
, , , ,