Bioinspired sparse spectro-temporal representation of speech for robust classification

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
558436	874929	2012	13 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

Sparse coding - کدینگ اسپارس یا کدگذاری تنک

موضوعات مرتبط

مهندسی و علوم پایه مهندسی کامپیوتر پردازش سیگنال

پیش نمایش صفحه اول مقاله

Bioinspired sparse spectro-temporal representation of speech for robust classification

چکیده انگلیسی

In this work, a first approach to a robust phoneme recognition task by means of a biologically inspired feature extraction method is presented. The proposed technique provides an approximation to the speech signal representation at the auditory cortical level. It is based on an optimal dictionary of atoms, estimated from auditory spectrograms, and the Matching Pursuit algorithm to approximate the cortical activations. This provides a sparse coding with intrinsic noise robustness, which can be therefore exploited when using the system in adverse environments. The recognition task consisted in the classification of a set of 5 easily confused English phonemes, in both clean and noisy conditions. Multilayer perceptrons were trained as classifiers and the performance was compared to other classic and robust parameterizations: the auditory spectrogram, a probabilistic optimum filtering on Mel frequency cepstral coefficients and the perceptual linear prediction coefficients. Results showed a significant improvement in the recognition rate of clean and noisy phonemes by the cortical representation over these other parameterizations.

► We address the problem of signal representation in the broad field of robust speech recognition.
► We propose a biologically inspired method to obtain the feature vectors by modeling the auditory cortical response to stimuli.
► The method calculates an optimal dictionary of atoms from the auditory spectrograms.
► The Matching Pursuit algorithm selects the more representative coefficients, thresholding the noisy components.
► For phoneme classification, our method outperforms the standard parameterizations used in automatic speech recognition.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Computer Speech & Language - Volume 26, Issue 5, October 2012, Pages 336–348

نویسندگان

C. Martínez, J. Goddard, D. Milone, H. Rufiner,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

Bioinspired sparse spectro-temporal representation of speech for robust classification

دسترسی سریع

ارتباط

English Website