کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
531244 869820 2011 13 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
A spatially adaptive statistical method for the binarization of historical manuscripts and degraded document images
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر چشم انداز کامپیوتر و تشخیص الگو
پیش نمایش صفحه اول مقاله
A spatially adaptive statistical method for the binarization of historical manuscripts and degraded document images
چکیده انگلیسی

In this paper, we present an adaptive method for the binarization of historical manuscripts and degraded document images. The proposed approach is based on maximum likelihood (ML) classification and uses a priori information and the spatial relationship on the image domain. In contrast with many conventional methods that use a decision based on thresholding, the proposed method performs a soft decision based on a probabilistic model. The main idea is that, from an initialization map (under-binarization) containing only the darkest part of the text, the method is able to recover the main text in the document image, including low-intensity and weak strokes. To do so, fast and robust local estimation of text and background features is obtained using grid-based modeling and inpainting techniques; then, the ML classification is performed to classify pixels into black and white classes. The advantage of the proposed method is that it preserves weak connections and provides smooth and continuous strokes, thanks to its correlation-based nature. Performance is evaluated both subjectively and objectively against standard databases. The proposed method outperforms the state-of-the-art methods presented in the DIBCO’09 binarization contest, although those other methods provide performance close to it.

Research highlights
► A new approach to the binarization of degraded document images is introduced.
► It spatially adapts a two-class maximum likelihood (ML) classifier to the pixels.
► The parameters of a class are computed locally from the gray-level distribution.
► Locally ML classification is performed in order to separate pixels in two classes.
► The proposed algorithm has been tested a the standard dataset.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Pattern Recognition - Volume 44, Issue 9, September 2011, Pages 2184–2196
نویسندگان
, , ,