دانلود رایگان مقاله: بهبود تشخیص گفتار با استفاده از تقویت داده ها و تلفیق مدل صوتی

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
4960609	1446503	2017	7 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

Improving speech recognition using data augmentation and acoustic model fusion

ترجمه فارسی عنوان

بهبود تشخیص گفتار با استفاده از تقویت داده ها و تلفیق مدل صوتی

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

تشخیص گفتار، یادگیری عمیق، افزایش اطلاعات، روش گروهی، رگرسیون لجستیک خطی،

Data augmentation - افزایش اطلاعات Speech recognition - تشخیص گفتار Ensemble method - روش گروهی Deep learning - یادگیری عمیق

موضوعات مرتبط

مهندسی و علوم پایه مهندسی کامپیوتر علوم کامپیوتر (عمومی)

پیش نمایش مقاله

بهبود تشخیص گفتار با استفاده از تقویت داده ها و تلفیق مدل صوتی

چکیده انگلیسی

Deep learning based systems have greatly improved the performance in speech recognition tasks, and various deep architectures and learning methods have been developed in the last few years. Along with that, Data Augmentation (DA), which is a common strategy adopted to increase the quantity of training data, has been shown to be effective for neural network training to make invariant predictions. On the other hand, Ensemble Method (EM) approaches have received considerable attention in the machine learning community to increase the effectiveness of classifiers. Therefore, we propose in this work a new Deep Neural Network (DNN) speech recognition architecture which takes advantage from both DA and EM approaches in order to improve the prediction accuracy of the system. In this paper, we first explore an existing approach based on vocal tract length perturbation and we propose a different DA technique based on feature perturbation to create a modified training data sets. Finally, EM techniques are used to integrate the posterior probabilities produced by different DNN acoustic models trained on different data sets. Experimental results demonstrate an increase in the recognition performance of the proposed system.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Procedia Computer Science - Volume 112, 2017, Pages 316-322

نویسندگان

Ilyes Rebai, Yessine BenAyed, Walid Mahdi, Jean-Pierre Lorré,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

دانلود رایگان مقاله ISI : بهبود تشخیص گفتار با استفاده از تقویت داده ها و تلفیق مدل صوتی

دسترسی سریع

ارتباط

English Website