کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
396458 666468 2006 24 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Extraction of transliteration pairs from parallel corpora using a statistical transliteration model
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی
پیش نمایش صفحه اول مقاله
Extraction of transliteration pairs from parallel corpora using a statistical transliteration model
چکیده انگلیسی

This paper describes a framework for modeling the machine transliteration problem. The parameters of the proposed model are automatically acquired through statistical learning from a bilingual proper name list. Unlike previous approaches, the model does not involve the use of either a pronunciation dictionary for converting source words into phonetic symbols or manually assigned phonetic similarity scores between source and target words. We also report how the model is applied to extract proper names and corresponding transliterations from parallel corpora. Experimental results show that the average rates of word and character precision are 93.8% and 97.8%, respectively.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Information Sciences - Volume 176, Issue 1, 6 January 2006, Pages 67–90
نویسندگان
, , ,