Using different acoustic, lexical and language modeling units for ASR of an under-resourced language

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
567032	1452042	2014	14 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

Using different acoustic, lexical and language modeling units for ASR of an under-resourced language – Amharic

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

Amharic Speech recognition - تشخیص گفتار Under-resourced languages - زبان های زیرزمینی

موضوعات مرتبط

مهندسی و علوم پایه مهندسی کامپیوتر پردازش سیگنال

پیش نمایش صفحه اول مقاله

Using different acoustic, lexical and language modeling units for ASR of an under-resourced language – Amharic

چکیده انگلیسی

State-of-the-art large vocabulary continuous speech recognition systems use mostly phone based acoustic models (AMs) and word based lexical and language models. However, phone based AMs are not efficient in modeling long-term temporal dependencies and the use of words in lexical and language models leads to out-of-vocabulary (OOV) problem, which is a serious issue for morphologically rich languages. This paper presents the results of our contributions on the use of different units for acoustic, lexical and language modeling for an under-resourced language (Amharic spoken in Ethiopia). Triphone, Syllable and hybrid (syllable-phone) units have been investigated for acoustic modeling. Word and morphemes have been investigated for lexical and language modeling. We have also investigated the use of longer (syllable) acoustic units and shorter (morpheme) lexical as well as language modeling units in a speech recognition system.Although hybrid AMs did not bring much improvement over context dependent syllable based recognizers in speech recognition performance with word based lexical and language model (i.e. word based speech recognition), we observed a significant word error rate (WER) reduction compared to triphone-based systems in morpheme-based speech recognition. Syllable AMs also led to a WER reduction over the triphone-based systems both in word based and morpheme based speech recognition. It was possible to obtain a 3% absolute WER reduction as a result of using syllable acoustic units in morpheme-based speech recognition. Overall, our result shows that syllable and hybrid AMs are best fitted in morpheme-based speech recognition.

► The best acoustic, lexical and language modeling units have been investigated.
► Triphone, syllable and hybrid (phone-syllable) acoustic models have been developed.
► Words and morphemes have been used as lexical and language modeling units.
► Syllable and hybrid acoustic models outperformed the triphone based ones.
► The use of morphemes in lexical and language modeling led to improved performance.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Speech Communication - Volume 56, January 2014, Pages 181–194

نویسندگان

Martha Yifiru Tachbelie, Solomon Teferra Abate, Laurent Besacier,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

Using different acoustic, lexical and language modeling units for ASR of an under-resourced language – Amharic

دسترسی سریع

ارتباط

English Website