دانلود رایگان مقاله: بهبود سیستم های تشخیص گفتار خودکار با استفاده از ویژگی های دینامیکی غیر خطی که از طرح تکرار سیگنال های گفتاری ارزیابی می شود

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
4955215	1444182	2017	12 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

Improvement of automatic speech recognition systems via nonlinear dynamical features evaluated from the recurrence plot of speech signals

ترجمه فارسی عنوان

بهبود سیستم های تشخیص گفتار خودکار با استفاده از ویژگی های دینامیکی غیر خطی که از طرح تکرار سیگنال های گفتاری ارزیابی می شود

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

Automatic speech recognition - تشخیص گفتار خودکار Recurrence plot - تکرار عادت Mel-frequency cepstral coefficients - ضرایب cepstral ملودی Reconstructed phase space - فاز بازسازی شده

موضوعات مرتبط

مهندسی و علوم پایه مهندسی کامپیوتر شبکه های کامپیوتری و ارتباطات

پیش نمایش مقاله

بهبود سیستم های تشخیص گفتار خودکار با استفاده از ویژگی های دینامیکی غیر خطی که از طرح تکرار سیگنال های گفتاری ارزیابی می شود

چکیده انگلیسی

- An effective algorithm is proposed for automatic speech recognition task using speech trajectories reconstructed in the phase space.
- The one-dimensional speech signal is converted into a two-dimensional image for speech recognition.
- The performance of proposed method is kept in noisy conditions.

The spectral-based features, typically used in Automatic Speech Recognition (ASR) systems, reject the phase information of speech signals. Thus, employing extra features, in which the phase of the signal is not rejected, may fill this gap. Embedding the speech signal in the Reconstructed Phase Space (RPS) and then extracting some useful features from it, is a recently considered approach in this field. In this paper, we will follow this approach by evaluating some useful features from the Recurrence Plot (RP) of the embedded speech signals in the RPS; the proposed features are evaluated via applying a two-dimensional wavelet transform to the resulted RP diagrams. The proposed features are examined in an ASR task alone and in combination with the traditional Mel-Frequency Cepstral Coefficients (MFCC). For the second case, using English TIMIT corpus, 3.94% absolute classification accuracy improvement in the phoneme recognition accuracy rate, against using only the MFCC features is gained.

196

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Computers & Electrical Engineering - Volume 58, February 2017, Pages 215-226

نویسندگان

Shabnam Gholamdokht Firooz, Farshad Almasganj, Yasser Shekofteh,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

دانلود رایگان مقاله ISI : بهبود سیستم های تشخیص گفتار خودکار با استفاده از ویژگی های دینامیکی غیر خطی که از طرح تکرار سیگنال های گفتاری ارزیابی می شود

دسترسی سریع

ارتباط

English Website