دانلود رایگان مقاله: رمزنگاری متنوع در لغت نامه های بیش از حد برای سازگاری سریع سیستم تشخیص گفتار

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
4973693	1451684	2017	17 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

Sparse coding over redundant dictionaries for fast adaptation of speech recognition system

ترجمه فارسی عنوان

رمزنگاری متنوع در لغت نامه های بیش از حد برای سازگاری سریع سیستم تشخیص گفتار

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

سازگاری سریع، درونی سازی مدل آکوستیک، برنامه نویسی انعطاف پذیر، فرهنگ لغت سخنران نمونه و آموخته شده،

Sparse coding - کدینگ اسپارس یا کدگذاری تنک

موضوعات مرتبط

مهندسی و علوم پایه مهندسی کامپیوتر پردازش سیگنال

پیش نمایش مقاله

رمزنگاری متنوع در لغت نامه های بیش از حد برای سازگاری سریع سیستم تشخیص گفتار

چکیده انگلیسی

This work presents a novel use of the sparse coding over redundant dictionary for fast adaptation of the acoustic models in the hidden Markov model-based automatic speech recognition (ASR) systems. The presented work is an extension of the existing acoustic model-interpolation-based fast adaptation approaches. In these methods, the basis (model) weights are estimated using an iterative procedure employing the maximum-likelihood (ML) criterion. For effective adaptation, typically a number of bases are selected and as a result of that the latency of the iterative weight estimation process becomes high for those ASR tasks that involve human-machine interactions. To address this issue, we propose the use of sparse coding of the target mean supervector over a speaker-specific (exemplar) redundant dictionary. In this approach, the employed greedy sparse coding not only selects the desired bases but also compresses them into a single supervector, which is then ML scaled to yield the adapted mean parameters. Thus reducing the latency in the basis weight estimation in comparison to the existing fast adaptation techniques. Further, to address the loss in information due to reduced degrees of freedom, we have also extended the proposed approach using separate sparse codings over multiple (exemplar and learned) redundant dictionaries. In adapting an ASR task involving human-computer interactions, the proposed approach is found to be as effective as the existing techniques but with a substantial reduction in the computational cost.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Computer Speech & Language - Volume 43, May 2017, Pages 1-17

نویسندگان

S. Shahnawazuddin, Rohit Sinha,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

دانلود رایگان مقاله ISI : رمزنگاری متنوع در لغت نامه های بیش از حد برای سازگاری سریع سیستم تشخیص گفتار

دسترسی سریع

ارتباط

English Website