کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
566004 1452024 2016 16 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Noise robust exemplar matching with alpha–beta divergence
ترجمه فارسی عنوان
تطبیق الگو با رضایت شغلی با واگرایی آلفا بتا
کلمات کلیدی
شناسایی خودکار گفتار، استحکام نویز، تطبیق نمونه واگرا آلفا بتا، خطای بازسازی
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر پردازش سیگنال
چکیده انگلیسی


• Noisy speech is modeled as a linear combination of multiple-length exemplars.
• The proposed recognizer uses the alpha–beta divergence with two parameters.
• The parameters are automatically adjusted to obtain better separation.
• The adaptive noise modeling approach is effective on the genuine room noise.
• Proposed recognizer provides improved noise robustness on AURORA-2 and CHIME-2 data.

The noise robust exemplar matching (N-REM) framework performs automatic speech recognition using exemplars, which are the labeled spectrographic representations of speech segments extracted from training data. By incorporating a sparse representations formulation, this technique remedies the inherent noise modeling problem of conventional exemplar matching-based automatic speech recognition systems. In this framework, noisy speech segments are approximated as a sparse linear combination of the exemplars of multiple lengths, each associated with a single speech unit such as words, half-words or phones. On account of the reconstruction error-based back end, the recognition accuracy highly depends on the congruence of the speech features and the divergence metric used to compare the speech segments with exemplars. In this work, we replace the conventional Kullback–Leibler divergence (KLD) with a generalized divergence family called the Alpha–Beta divergence with two parameters, α and β  , in conjunction with mel-scaled magnitude spectral features. The proposed recognizer traverses the (α,βα,β) plane depending on the amount of contamination to provide better separation of speech and noise sources. Moreover, we apply our recently proposed active noise exemplar selection (ANES) technique in a more realistic scenario where the target utterances are degraded by genuine room noise. Recognition experiments on the small vocabulary track of the 2nd CHiME Challenge and the AURORA-2 database have shown that the novel recognizer with the AB divergence and ANES outperforms the baseline system using the generalized KLD with tuned sparsity, especially at lower SNR levels.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Speech Communication - Volume 76, February 2016, Pages 127–142
نویسندگان
, , ,