Article ID Journal Published Year Pages File Type
526811 Image and Vision Computing 2015 19 Pages PDF
Abstract

•Background removal technique based on adaptive median filtering and thresholding•A Local Co-occurrence Map with local contrast can distinguish between document text and document stains and background.•Low complexity approach with fast and accurate binarization results

In this paper, we address the document image binarization problem with a three-stage procedure. First, possible stains and general document background information are removed from the image through a background removal stage. The remaining misclassified background and character pixels are then separated using a Local Co-occurrence Mapping, local contrast and a two-state Gaussian Mixture Model. Finally, some isolated misclassified components are removed by a morphology operator. The proposed scheme offers robust and fast performance, especially for both handwritten and printed documents, which compares favorably with other binarization methods.

Related Topics
Physical Sciences and Engineering Computer Science Computer Vision and Pattern Recognition
Authors
, ,