کد مقاله | کد نشریه | سال انتشار | مقاله انگلیسی | نسخه تمام متن |
---|---|---|---|---|
455717 | 695535 | 2013 | 12 صفحه PDF | دانلود رایگان |

Recently, there has been a significant increase in studies employing auditory models in speech recognition systems. In this paper, we propose a new evolutionary tuned feature extraction method by spectro-temporal analysis. In our proposed model, there is a special subspace for each phoneme with a specific best scale in the spectral filter and a specific best rate in the temporal filter. These two parameters were obtained by genetic cellular automata evolutionary algorithm. The extracted features from the specific subspace are classified by a binary one-versus-rest support vector machine. Finally, a multiclass classifier for all phonemes is employed by combining these sub-models. The proposed method improved the discrimination of phonemes significantly especially in highly confusable phonemes. To show the efficiency of the proposed feature sets, it was empirically compared with two baseline models. The achieved relative improvements are about 10% in classification rate for voiced plosives, unvoiced plosives and nasals; and about 7.38% for front vowels relative to the state of the art baseline model.
Figure optionsDownload as PowerPoint slideHighlights
► We propose a new evolutionary tuned feature extraction method by spectro-temporal analysis.
► Each of the phonemes can well be discriminated in a very small subspace of high-dimensional cortical representation.
► The method improved the discrimination of phonemes significantly especially in highly confusable phonemes.
Journal: Computers & Electrical Engineering - Volume 39, Issue 5, July 2013, Pages 1537–1548