دانلود رایگان مقاله: در مورد استفاده از شبکه های عصبی فیدر عمیق برای شناسایی خودکار زبان

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
6951554	1451687	2016	23 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

On the use of deep feedforward neural networks for automatic language identification

ترجمه فارسی عنوان

در مورد استفاده از شبکه های عصبی فیدر عمیق برای شناسایی خودکار زبان

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

LID DNN I-vectors Bottleneck - تنگنا

موضوعات مرتبط

مهندسی و علوم پایه مهندسی کامپیوتر پردازش سیگنال

پیش نمایش مقاله

در مورد استفاده از شبکه های عصبی فیدر عمیق برای شناسایی خودکار زبان

چکیده انگلیسی

In this work, we present a comprehensive study on the use of deep neural networks (DNNs) for automatic language identification (LID). Motivated by the recent success of using DNNs in acoustic modeling for speech recognition, we adapt DNNs to the problem of identifying the language in a given utterance from its short-term acoustic features. We propose two different DNN-based approaches. In the first one, the DNN acts as an end-to-end LID classifier, receiving as input the speech features and providing as output the estimated probabilities of the target languages. In the second approach, the DNN is used to extract bottleneck features that are then used as inputs for a state-of-the-art i-vector system. Experiments are conducted in two different scenarios: the complete NIST Language Recognition Evaluation dataset 2009 (LRE'09) and a subset of the Voice of America (VOA) data from LRE'09, in which all languages have the same amount of training data. Results for both datasets demonstrate that the DNN-based systems significantly outperform a state-of-art i-vector system when dealing with short-duration utterances. Furthermore, the combination of the DNN-based and the classical i-vector system leads to additional performance improvements (up to 45% of relative improvement in both EER and Cavg on 3s and 10s conditions, respectively).

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Computer Speech & Language - Volume 40, November 2016, Pages 46-59

نویسندگان

Ignacio Lopez-Moreno, Javier Gonzalez-Dominguez, David Martinez, OldÅich Plchot, Joaquin Gonzalez-Rodriguez, Pedro J. Moreno,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

دانلود رایگان مقاله ISI : در مورد استفاده از شبکه های عصبی فیدر عمیق برای شناسایی خودکار زبان

دسترسی سریع

ارتباط

English Website