کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
536649 870591 2008 8 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Skew detection for complex document images using robust borderlines in both text and non-text regions
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر چشم انداز کامپیوتر و تشخیص الگو
پیش نمایش صفحه اول مقاله
Skew detection for complex document images using robust borderlines in both text and non-text regions
چکیده انگلیسی

A new skew detection method for complex document images based on robust borderlines extracted from both text and non-text regions is proposed in this paper. First, borderlines are extracted from the borders of large connected components in a document image by using a run length based method. Second, after filtering out non-linear borderlines, a fast iteration algorithm is applied to optimize each linear borderline’s directional angle. Finally, the weighted median value of all the directional angles is calculated as the skew angle of the whole document. Experiments on 2000 various skew document images are implemented. Total correct rate is 95.2%, and the detecting time on average is less than 0.2 s for each document. The proposed skew detection method is efficient for complex documents with horizontal and vertical text layout, three kinds of linguistic characters in English, Japanese and Chinese, especially for documents with predominant non-text regions or sparse text regions.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Pattern Recognition Letters - Volume 29, Issue 13, 1 October 2008, Pages 1893–1900
نویسندگان
, , , ,