کد مقاله | کد نشریه | سال انتشار | مقاله انگلیسی | نسخه تمام متن |
---|---|---|---|---|
4955215 | 1444182 | 2017 | 12 صفحه PDF | دانلود رایگان |
- An effective algorithm is proposed for automatic speech recognition task using speech trajectories reconstructed in the phase space.
- The one-dimensional speech signal is converted into a two-dimensional image for speech recognition.
- The performance of proposed method is kept in noisy conditions.
The spectral-based features, typically used in Automatic Speech Recognition (ASR) systems, reject the phase information of speech signals. Thus, employing extra features, in which the phase of the signal is not rejected, may fill this gap. Embedding the speech signal in the Reconstructed Phase Space (RPS) and then extracting some useful features from it, is a recently considered approach in this field. In this paper, we will follow this approach by evaluating some useful features from the Recurrence Plot (RP) of the embedded speech signals in the RPS; the proposed features are evaluated via applying a two-dimensional wavelet transform to the resulted RP diagrams. The proposed features are examined in an ASR task alone and in combination with the traditional Mel-Frequency Cepstral Coefficients (MFCC). For the second case, using English TIMIT corpus, 3.94% absolute classification accuracy improvement in the phoneme recognition accuracy rate, against using only the MFCC features is gained.
196
Journal: Computers & Electrical Engineering - Volume 58, February 2017, Pages 215-226