Extraction of transliteration pairs from parallel corpora using a statistical transliteration model

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
396458	666468	2006	24 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

Parallel corpora Statistical learning - یادگیری آماری

موضوعات مرتبط

مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی

پیش نمایش صفحه اول مقاله

Extraction of transliteration pairs from parallel corpora using a statistical transliteration model

چکیده انگلیسی

This paper describes a framework for modeling the machine transliteration problem. The parameters of the proposed model are automatically acquired through statistical learning from a bilingual proper name list. Unlike previous approaches, the model does not involve the use of either a pronunciation dictionary for converting source words into phonetic symbols or manually assigned phonetic similarity scores between source and target words. We also report how the model is applied to extract proper names and corresponding transliterations from parallel corpora. Experimental results show that the average rates of word and character precision are 93.8% and 97.8%, respectively.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Information Sciences - Volume 176, Issue 1, 6 January 2006, Pages 67–90

نویسندگان

Chun-Jen Lee, Jason S. Chang, Jyh-Shing Roger Jang,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

Extraction of transliteration pairs from parallel corpora using a statistical transliteration model

دسترسی سریع

ارتباط

English Website