Article ID Journal Published Year Pages File Type
534771 Pattern Recognition Letters 2012 9 Pages PDF
Abstract

This paper presents a novel local threshold algorithm for the binarization of document images. Stroke width of handwritten and printed characters in documents is utilized as the shape feature. As a result, in addition to the intensity analysis, the proposed algorithm introduces the stroke width as shape information into local thresholding. Experimental results for both synthetic and practical document images show that the proposed local threshold algorithm is superior in terms of segmentation quality to the threshold approaches that solely use intensity information.

► This paper presents a novel local threshold algorithm for the binarization of document images. ► Stroke width of handwritten and printed characters in documents is utilized as the shape feature. ► The shape of stroke width is captured by forming a histogram of distance transform. ► A recursive image domain sub-division algorithm is designed with shape histogram. ► Proposed method yields competitive results on DIBCO2009 dataset.

Keywords
Related Topics
Physical Sciences and Engineering Computer Science Computer Vision and Pattern Recognition
Authors
, , ,