دانلود رایگان مقاله: انتخاب کانال فاصله Cepstral برای تشخیص گفتار از راه دور

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
4973652	1451680	2018	19 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

Cepstral distance based channel selection for distant speech recognition

ترجمه فارسی عنوان

انتخاب کانال فاصله Cepstral برای تشخیص گفتار از راه دور

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

تشخیص گفتار دور انتخاب کانال؛ فاصله Cepstral؛ بازتاب مستقیم نسبت به بازتابنده؛ T60

Distant speech recognition Channel selection - انتخاب کانال Reverberation - بازتاب

موضوعات مرتبط

مهندسی و علوم پایه مهندسی کامپیوتر پردازش سیگنال

پیش نمایش مقاله

انتخاب کانال فاصله Cepstral برای تشخیص گفتار از راه دور

چکیده انگلیسی

- This work concerns distant speech recognition (DSR) with microphones sparse in space.
- We introduce a methodology to conduct studies on channel selection (CS) for DSR.
- A CS method is proposed and relies on cepstral distance as measure of signal quality.
- Experiments are conducted on both simulated and real multi-microphone data sets.
- Results demonstrate the effectiveness of the proposed methodology and techniques.

Shifting from a single to a multi-microphone setting, distant speech recognition can be benefited from the multiple instances of the same utterance in many ways. An effective approach, especially when microphones are not organized in an array fashion, is given by channel selection (CS), which assumes that for each utterance there is at least one channel that can improve the recognition results when compared to the decoding of the remaining channels. In order to identify this most favourable channel, a possible approach is to estimate the degree of distortion that characterizes each microphone signal. In a reverberant environment, this distortion can vary significantly across microphones, for instance due to the orientation of the speaker's head. In this work, we investigate on the application of cepstral distance as a distortion measure that turns out to be closely related to properties of the room acoustics, such as reverberation time and direct-to-reverberant ratio. From this measure, a blind CS method is derived, which relies on a reference computed by averaging log magnitude spectra of all the microphone signals. Another aim of our study is to propose a novel methodology to analyze CS under a wide set of experimental conditions and setup variations, which depend on the sound source position, its orientation, and the microphone network configuration. Based on the use of prior information, we introduce an informed technique to predict CS performance. Experimental results show both the effectiveness of the proposed blind CS method and the value of the aforementioned analysis methodology. The experiments were conducted using different sets of real and simulated data, the latter ones derived from synthetic and from measured impulse responses. It is demonstrated that the proposed blind CS method is well related to the oracle selection of the best recognized channel. Moreover, our method outperforms a state-of-the-art one, especially on real data.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Computer Speech & Language - Volume 47, January 2018, Pages 314-332

نویسندگان

Cristina Guerrero Flores, Georgina Tryfou, Maurizio Omologo,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

دانلود رایگان مقاله ISI : انتخاب کانال فاصله Cepstral برای تشخیص گفتار از راه دور

دسترسی سریع

ارتباط

English Website