کد مقاله | کد نشریه | سال انتشار | مقاله انگلیسی | نسخه تمام متن |
---|---|---|---|---|
565923 | 1452041 | 2014 | 11 صفحه PDF | دانلود رایگان |
• Significant WER reduction compared to the random selection is reported.
• Automatic speech recognition may benefit from channel selection.
• Overview of the channel selection measures is presented and new measures are proposed.
• Good performance of a measure based on the variance of the speech intensity envelope.
• Combination of the measures brings further improvements.
Automatic speech recognition in a room with distant microphones is strongly affected by noise and reverberation. In scenarios where the speech signal is captured by several arbitrarily located microphones the degree of distortion differs from one channel to another. In this work we deal with measures extracted from a given distorted signal that either estimate its quality or measure how well it fits the acoustic models of the recognition system. We then apply them to solve the problem of selecting the signal (i.e. the channel) that presumably leads to the lowest recognition error rate. New channel selection techniques are presented, and compared experimentally in reverberant environments with other approaches reported in the literature. Significant improvements in recognition rate are observed for most of the measures. A new measure based on the variance of the speech intensity envelope shows a good trade-off between recognition accuracy, latency and computational cost. Also, the combination of measures allows a further improvement in recognition rate.
Journal: Speech Communication - Volume 57, February 2014, Pages 170–180