کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
558370 874913 2010 15 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Monaural speech separation based on MAXVQ and CASA for robust speech recognition
کلمات کلیدی
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر پردازش سیگنال
پیش نمایش صفحه اول مقاله
Monaural speech separation based on MAXVQ and CASA for robust speech recognition
چکیده انگلیسی

Robustness is one of the most important topics for automatic speech recognition (ASR) in practical applications. Monaural speech separation based on computational auditory scene analysis (CASA) offers a solution to this problem. In this paper, a novel system is presented to separate the monaural speech of two talkers. Gaussian mixture models (GMMs) and vector quantizers (VQs) are used to learn the grouping cues on isolated clean data for each speaker. Given an utterance, speaker identification is firstly performed to identify the two speakers presented in the utterance, then the factorial-max vector quantization model (MAXVQ) is used to infer the mask signals and finally the utterance of the target speaker is resynthesized in the CASA framework. Recognition results on the 2006 speech separation challenge corpus prove that this proposed system can improve the robustness of ASR significantly.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Computer Speech & Language - Volume 24, Issue 1, January 2010, Pages 30–44
نویسندگان
, , , , ,