کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
485463 703327 2016 8 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
A Study of Statistical Machine Translation Methods for Under Resourced Languages
ترجمه فارسی عنوان
مطالعه روشهای آماری ماشین آماری برای زبانهای تحت رشته
کلمات کلیدی
ترجمه ماشین تحت زبان منابع، مبتنی بر عبارت، بر اساس طبقه بندی سلسله مراتبی، مدل توالی عملیاتی، مبتنی بر نحو
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر علوم کامپیوتر (عمومی)
چکیده انگلیسی

This paper contributes an empirical study of the application of five state-of-the-art machine translation to the trans- lation of low-resource languages. The methods studied were phrase-based, hierarchical phrase-based, the operational sequence model, string-to-tree, tree-to-string statistical machine translation methods between English (en) and the under resourced languages Lao (la), Myanmar (mm), Thai (th) in both directions. The performance of the machine translation systems was automatically measured in terms of BLEU and RIBES for all experiments. Our main findings were that the phrase-based SMT method generally gave the highest BLEU scores. This was counter to expectations, and we believe indicates that this method may be more robust to limitations on the data set size. However, when evaluated with RIBES, the best scores came from methods other than phrase-based SMT, indicating that the other methods were able to handle the word re-ordering better even under the constraint of limited data. Our study achieved the highest reported results on the data sets for all translation language pairs.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Procedia Computer Science - Volume 81, 2016, Pages 250–257
نویسندگان
, , , ,