دانلود رایگان مقاله: برای رسیدن به آرامش تشخیص گفتار اندونزیایی با مدلهای آکوستیک اقتباس شده سخنرانی آ

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
485451	703327	2016	7 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

Towards Robust Indonesian Speech Recognition with Spontaneous-Speech Adapted Acoustic Models

ترجمه فارسی عنوان

برای رسیدن به آرامش تشخیص گفتار اندونزیایی با مدلهای آکوستیک اقتباس شده سخنرانی آ

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

spontaneous speech - گفتار خودبهخودی

موضوعات مرتبط

مهندسی و علوم پایه مهندسی کامپیوتر علوم کامپیوتر (عمومی)

پیش نمایش مقاله

برای رسیدن به آرامش تشخیص گفتار اندونزیایی با مدلهای آکوستیک اقتباس شده سخنرانی آ

چکیده انگلیسی

This paper presents our work in building an Indonesian speech recognizer to handle both spontaneous and dictated speech. The recognizer is based on the Gaussian Mixture and Hidden Markov Models (GMM-HMM). The model is first trained on 73 hours of dictated speech and 43.5 minutes of spontaneous speech. The dictated speech is read from prepared transcripts by a diverse group of 244 Indonesian speakers. The spontaneous speech is manually labelled from recordings of an Indonesian parliamentary meeting, and is interspersed with noises and fillers. The resulting triphone model is then adapted only to the spontaneous speech using the Maximum A-posteriori Probability (MAP) method. We evaluate the adapted model using separate dictated and spontaneous evaluation sets. The dictated set consists of speech from 20 speakers totaling 14.5 hours. The spontaneous set is derived from the recording of a regional government meeting, consisting of 1085 utterances totaling 48.5 minutes. Evaluation of a MAP-adapted spontaneous set yields a 2.60% absolute increase in Word Accuracy Rate (WAR) over the un-adapted model, outperforming MMI adaptation. Conversely, MMI adaption of the dictated set outperforms the MAP adaptation by achieving an absolute increase of 1.48% in WAR over the un-adapted model. We also demonstrate that fMLLR speaker adaptation is unsuitable for our task due to limited adaptation data.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Procedia Computer Science - Volume 81, 2016, Pages 167–173

نویسندگان

Devin Hoesen, Cil Hardianto Satriawan, Dessi Puji Lestari, Masayu Leylia Khodra,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

دانلود رایگان مقاله ISI : برای رسیدن به آرامش تشخیص گفتار اندونزیایی با مدلهای آکوستیک اقتباس شده سخنرانی آ

دسترسی سریع

ارتباط

English Website