کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
1110926 1488361 2015 5 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
A Structure for Annotation and Ground-truthing of Urdu Handwritten Text Image Corpus
موضوعات مرتبط
علوم انسانی و اجتماعی علوم انسانی و هنر هنر و علوم انسانی (عمومی)
پیش نمایش صفحه اول مقاله
A Structure for Annotation and Ground-truthing of Urdu Handwritten Text Image Corpus
چکیده انگلیسی

Over the last few decades, a large evolution has been made in the field of handwritten recognition. Material of handwritten documents is become less with current trends of digital electronics. However, for the investigation and research on a particular language a large volume of handwritten documents database is required. In this paper we describe our approach for development a large volume of Urdu handwritten text images Corpus on Urdu language. To make the database available in large field of Natural Language Processing we annotate database for each image and associate a XML based ground-truth Meta information to make it computer compatible as a linguistic resource. This paper focus on the some issue related with Corpus design and annotation such as data collection, writers selection, methodology of annotation etc.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Procedia - Social and Behavioral Sciences - Volume 198, 24 July 2015, Pages 84-88