دانلود رایگان مقاله: بررسی منابع داده بزرگ برای آموزش چند زبانه و پورت سیستم

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
485430	703327	2016	8 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

Study of Large Data Resources for Multilingual Training and System Porting

ترجمه فارسی عنوان

بررسی منابع داده بزرگ برای آموزش چند زبانه و پورت سیستم

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

بطری گردن انباشته شده استخراج ویژگی، آموزش چند زبانه، داده های بزرگ، پایگاه داده فیشر

Feature extraction - استخراج ویژگی Large data - داده های بزرگ

موضوعات مرتبط

مهندسی و علوم پایه مهندسی کامپیوتر علوم کامپیوتر (عمومی)

پیش نمایش مقاله

بررسی منابع داده بزرگ برای آموزش چند زبانه و پورت سیستم

چکیده انگلیسی

This study investigates the behavior of a feature extraction neural network model trained on a large amount of single language data (“source language”) on a set of under-resourced target languages. The coverage of the source language acoustic space was changed in two ways: (1) by changing the amount of training data and (2) by altering the level of detail of acoustic units (by changing the triphone clustering). We observe the effect of these changes on the performance on target language in two scenarios: (1) the source-language NNs were used directly, (2) NNs were first ported to target language.The results show that increasing coverage as well as level of detail on the source language improves the target language system performance in both scenarios. For the first one, both source language characteristic have about the same effect. For the second scenario, the amount of data in source language is more important than the level of detail.The possibility to include large data into multilingual training set was also investigated. Our experiments point out possible risk of over-weighting the NNs towards the source language with large data. This degrades the performance on part of the target languages, compared to the setting where the amounts of data per language are balanced.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Procedia Computer Science - Volume 81, 2016, Pages 15–22

نویسندگان

František Grézl, Ekaterina Egorova, Martin Karafiát,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

دانلود رایگان مقاله ISI : بررسی منابع داده بزرگ برای آموزش چند زبانه و پورت سیستم

دسترسی سریع

ارتباط

English Website