Integration of beamforming and uncertainty-of-observation techniques for robust ASR in multi-source environments

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
558361	874908	2013	14 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

Uncertainty propagation - انتشار عدم اطمینان Robust speech recognition - شناسایی قوی سخنرانی Beamforming - شکل‌دهی پرتو Uncertainty decoding - عدم قطعیت رمزگشایی

موضوعات مرتبط

مهندسی و علوم پایه مهندسی کامپیوتر پردازش سیگنال

پیش نمایش صفحه اول مقاله

Integration of beamforming and uncertainty-of-observation techniques for robust ASR in multi-source environments

چکیده انگلیسی

This paper presents a new approach for increasing the robustness of multi-channel automatic speech recognition in noisy and reverberant multi-source environments. The proposed method uses uncertainty propagation techniques to dynamically compensate the speech features and the acoustic models for the observation uncertainty determined at the beamforming stage. We present and analyze two methods that allow integrating classical multi-channel signal processing approaches like delay and sum beamformers or Zelinski-type Wiener filters, with uncertainty-of-observation techniques like uncertainty decoding or modified imputation. An analysis of the results on the PASCAL-CHiME task shows that this approach consistently outperforms conventional beamformers with a minimal increase in computational complexity. The use of dynamic compensation based on observation uncertainty also outperforms conventional static adaptation with no need of adaptation data.

► We address the integration of beamforming and automatic speech recognition.
► We propose propagating the uncertainty at the beamforming stage to the feature domain.
► The methods compute uncertainty for delay-and-sum and Zelinski–Wiener beamformers.
► Resulting algorithms provide increased performance at low computational cost.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Computer Speech & Language - Volume 27, Issue 3, May 2013, Pages 837–850

نویسندگان

Ramón Fernandez Astudillo, Dorothea Kolossa, Alberto Abad, Steffen Zeiler, Rahim Saeidi, Pejman Mowlaee, João Paulo da Silva Neto, Rainer Martin,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

Integration of beamforming and uncertainty-of-observation techniques for robust ASR in multi-source environments

دسترسی سریع

ارتباط

English Website