کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
536159 870473 2016 8 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Baseline detection of multi-lingual unconstrained handwritten text lines
ترجمه فارسی عنوان
تشخیص پایه خطوط متن دست خط چندزبانه بدون محدودیت
کلمات کلیدی
تشخیص دست خط، تشخیص پایه، تجزیه و تحلیل سند چند زبانه، پردازش متن چند منظوره
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر چشم انداز کامپیوتر و تشخیص الگو
چکیده انگلیسی


• A novel method for baseline detection of multi-lingual multi-oriented text lines.
• To our knowledge, this is the first baseline detection method for multi-turn text lines.
• The method uses machine learning along with rotation invariant features for constructing the baseline.
• The method improves the performance of the state-of-the-art character segmentation method substantially.

Many handwritten text recognition systems use the baseline information for better recognition of text line characters. Improper baseline detection reduces the performance of the recognition. In this paper we propose a novel baseline detection scheme for unconstrained handwritten text lines of multilingual documents. For baseline detection of a text line, at first, we detect the set of significant contour points (S-points) of the text line. Every non-singleton subsets of S-points forms a curve. The orientation invariant features of the curve determine whether the curve can construct a probable baseline of the input text line or not. It is determined by an SVM, trained using the orientation invariant features of the curves. The curves classified as probable baselines, are sorted according to their relative positions in ascending order to get the optimal baseline. We tested our method on different handwritten text lines of Bangla(Bengali), English(Roman), Kannada, Oriya, Devnagari and Persian scripts and obtained encouraging results.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Pattern Recognition Letters - Volume 74, 15 April 2016, Pages 74–81
نویسندگان
, ,