کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
559024 875034 2014 11 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Efficient data selection for speech recognition based on prior confidence estimation using speech and monophone models
ترجمه فارسی عنوان
انتخاب داده های کارا برای تشخیص گفتار بر اساس برآورد اعتبار پیشین با استفاده از مدل های گفتاری و تک
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر پردازش سیگنال
چکیده انگلیسی


• We propose a method to select highly accurate data for speech recognition.
• We rapidly estimate prior confidence before speech recognition.
• Our prior estimation uses the acoustic likelihood of speech and monophone models.
• The proposed technique is over fifty times faster than the conventional method.
• Our proposal provides equivalent data selection performance.

This paper proposes an efficient speech data selection technique that can identify those data that will be well recognized. Conventional confidence measure techniques can also identify well-recognized speech data. However, those techniques require a lot of computation time for speech recognition processing to estimate confidence scores. Speech data with low confidence should not go through the time-consuming recognition process since they will yield erroneous spoken documents that will eventually be rejected. The proposed technique can select the speech data that will be acceptable for speech recognition applications. It rapidly selects speech data with high prior confidence based on acoustic likelihood values and using only speech and monophone models. Experiments show that the proposed confidence estimation technique is over 50 times faster than the conventional posterior confidence measure while providing equivalent data selection performance for speech recognition and spoken document retrieval.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Computer Speech & Language - Volume 28, Issue 6, November 2014, Pages 1287–1297
نویسندگان
, , , , ,