دانلود رایگان مقاله: از ویژگی های نقشه برداری با استفاده از میکروفن دور میدان برای تشخیص گفتار دور

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
568460	1452017	2016	9 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

Feature mapping using far-field microphones for distant speech recognition

ترجمه فارسی عنوان

از ویژگی های نقشه برداری با استفاده از میکروفن دور میدان برای تشخیص گفتار دور

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

شبکه عصبی عمیق; ویژگی های تنگنا; تشخیص گفتار دور; جلسات; جسم AMI

Distant speech recognition Bottleneck features Meetings - جلسات Deep neural network - شبکه عصبی عمیق

موضوعات مرتبط

مهندسی و علوم پایه مهندسی کامپیوتر پردازش سیگنال

پیش نمایش مقاله

از ویژگی های نقشه برداری با استفاده از میکروفن دور میدان برای تشخیص گفتار دور

چکیده انگلیسی

• A nonlinear DNN bottleneck feature mapping using deep neural network is proposed.
• Shows that the feature mapping improves distant speech recognition performance.
• Shows that the feature mapping is complementary to fMLLR for speaker adaptation.
• Shows that the feature mapping generalizes to unseen conditions.
• Shows that DNN bottleneck features from a multi-condition network are robust to noise.

Acoustic modeling based on deep architectures has recently gained remarkable success, with substantial improvement of speech recognition accuracy in several automatic speech recognition (ASR) tasks. For distant speech recognition, the multi-channel deep neural network based approaches rely on the powerful modeling capability of deep neural network (DNN) to learn suitable representation of distant speech directly from its multi-channel source. In this model-based combination of multiple microphones, features from each channel are concatenated and used together as an input to DNN. This allows integrating the multi-channel audio for acoustic modeling without any pre-processing steps. Despite powerful modeling capabilities of DNN, an environmental mismatch due to noise and reverberation may result in severe performance degradation when features are simply fed to a DNN without a feature enhancement step. In this paper, we introduce the nonlinear bottleneck feature mapping approach using DNN, to transform the noisy and reverberant features to its clean version. The bottleneck features derived from the DNN are used as a teacher signal because they contain relevant information to phoneme classification, and the mapping is performed with the objective of suppressing noise and reverberation. The individual and combined impacts of beamforming and speaker adaptation techniques along with the feature mapping are examined for distant large vocabulary speech recognition, using a single and multiple far-field microphones. As an alternative to beamforming, experiments with concatenating multiple channel features are conducted. The experimental results on the AMI meeting corpus show that the feature mapping, used in combination with beamforming and speaker adaptation yields a distant speech recognition performance below 50% word error rate (WER), using DNN for acoustic modeling.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Speech Communication - Volume 83, October 2016, Pages 1–9

نویسندگان

Ivan Himawan, Petr Motlicek, David Imseng, Sridha Sridharan,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

دانلود رایگان مقاله ISI : از ویژگی های نقشه برداری با استفاده از میکروفن دور میدان برای تشخیص گفتار دور

دسترسی سریع

ارتباط

English Website