دانلود رایگان مقاله: مدل سازی آکوستیک برای تشخیص گفتار در زبان هندی در زمینه کاری محصولات کشاورزی

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
567031	1452042	2014	14 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

Acoustic modelling for speech recognition in Indian languages in an agricultural commodities task domain

ترجمه فارسی عنوان

مدل سازی آکوستیک برای تشخیص گفتار در زبان هندی در زمینه کاری محصولات کشاورزی

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

شناسایی خودکار گفتار، تشخیص گفتار چند زبانه، مدلسازی زیربخش، زبانهای تحت منابع محدود، نرمال صوتی

Automatic speech recognition - تشخیص گفتار خودکار Under-resourced languages - زبان های زیرزمینی

موضوعات مرتبط

مهندسی و علوم پایه مهندسی کامپیوتر پردازش سیگنال

پیش نمایش مقاله

مدل سازی آکوستیک برای تشخیص گفتار در زبان هندی در زمینه کاری محصولات کشاورزی

چکیده انگلیسی

• “Real-world” data is used for speech recognition systems in 4 Indian languages.
• The subspace Gaussian mixture model is effective for insufficient training data.
• Cross-corpus acoustic mismatch is a serious issue for multi-lingual systems.
• Apparent cross-lingual phonetic similarities for Hindi and Marathi are “discovered”.

In developing speech recognition based services for any task domain, it is necessary to account for the support of an increasing number of languages over the life of the service. This paper considers a small vocabulary speech recognition task in multiple Indian languages. To configure a multi-lingual system in this task domain, an experimental study is presented using data from two linguistically similar languages – Hindi and Marathi. We do so by training a subspace Gaussian mixture model (SGMM) (Povey et al., 2011 and Rose et al., 2011) under a multi-lingual scenario (Burget et al., 2010 and Mohan et al., 2012a). Speech data was collected from the targeted user population to develop spoken dialogue systems in an agricultural commodities task domain for this experimental study. It is well known that acoustic, channel and environmental mismatch between data sets from multiple languages is an issue while building multi-lingual systems of this nature. As a result, we use a cross-corpus acoustic normalization procedure which is a variant of speaker adaptive training (SAT) (Mohan et al., 2012a). The resulting multi-lingual system provides the best speech recognition performance for both languages. Further, the effect of sharing “similar” context-dependent states from the Marathi language on the Hindi speech recognition performance is presented.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Speech Communication - Volume 56, January 2014, Pages 167–180

نویسندگان

Aanchan Mohan, Richard Rose, Sina Hamidi Ghalehjegh, S. Umesh,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

دانلود رایگان مقاله ISI : مدل سازی آکوستیک برای تشخیص گفتار در زبان هندی در زمینه کاری محصولات کشاورزی

دسترسی سریع

ارتباط

English Website