دانلود رایگان مقاله: گروه شبکه های عصبی عمیق با استفاده از طبقه بندی صوتی محیط برای تشخیص فعالیت آماری بر اساس مدل صوتی

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
558198	1451689	2016	12 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

Ensemble of deep neural networks using acoustic environment classification for statistical model-based voice activity detection

ترجمه فارسی عنوان

گروه شبکه های عصبی عمیق با استفاده از طبقه بندی صوتی محیط برای تشخیص فعالیت آماری بر اساس مدل صوتی

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

تشخیص فعالیت صدا; مدل; طبقه بندی محیط صوتی; شبکه عصبی عمیق; گروه

Voice activity detection - تشخیص فعالیت صوتی Deep neural network - شبکه عصبی عمیق Statistical model - مدل آماری Ensemble - گروهی

موضوعات مرتبط

مهندسی و علوم پایه مهندسی کامپیوتر پردازش سیگنال

پیش نمایش مقاله

گروه شبکه های عصبی عمیق با استفاده از طبقه بندی صوتی محیط برای تشخیص فعالیت آماری بر اساس مدل صوتی

چکیده انگلیسی

• We develop the voice activity detection based on statistical model.
• The DNNs are used for the voice activity detection.
• Ensemble of the DNN is devised for different noise environments.
• A separate DNN is built to detect the current environment.

In this paper, we investigate the ensemble of deep neural networks (DNNs) by using an acoustic environment classification (AEC) technique for the statistical model-based voice activity detection (VAD). From an investigation of the statistical model-based VAD, it is known that the traditional decision rule is based on the geometric mean of the likelihood ratio or the support vector machine (SVM), which is a shallow model with zero or one hidden layer. Since the shallow models cannot take an advantage of the diversity of the space distribution of features, in the training step, we basically build the multiple DNNs according the different noise types by employing the parameters of the statistical model-based VAD algorithm. In addition, the separate DNN is designed for the AEC algorithm in order to choose the best DNN for each noise. In the on-line noise-aware VAD step, the AEC is first performed on a frame-by-frame basis using the separate DNN so the a posteriori probabilities to identify noise are obtained. Once the probabilities are achieved for each noise, the environmental knowledge is contributed to allow us to combine the speech presence probabilities which are derived from the ensemble of the DNNs trained for the individual noise. Our approach for VAD was evaluated in terms of objective measures and showed significant improvement compared to the conventional algorithm.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Computer Speech & Language - Volume 38, July 2016, Pages 1–12

نویسندگان

Inyoung Hwang, Hyung-Min Park, Joon-Hyuk Chang,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

دانلود رایگان مقاله ISI : گروه شبکه های عصبی عمیق با استفاده از طبقه بندی صوتی محیط برای تشخیص فعالیت آماری بر اساس مدل صوتی

دسترسی سریع

ارتباط

English Website