کد مقاله | کد نشریه | سال انتشار | مقاله انگلیسی | نسخه تمام متن |
---|---|---|---|---|
563115 | 875471 | 2013 | 13 صفحه PDF | دانلود رایگان |
This study focuses on automatic visual speech recognition in the presence of noise. The authors show that, when speech is produced in noisy environments, articulatory changes occur because of the Lombard effect; these changes are both audible and visible. The authors analyze the visual Lombard effect and its role in automatic visual- and audiovisual speech recognition. Experimental results using both English and Japanese data demonstrate the negative effect of the Lombard effect in the visual speech domain. Without considering this factor in designing a lip-reading system, the performance of the system decreases. This is very important in audiovisual speech automatic recognition in real noisy environments. In such a case, however, the recognition rates decrease because of the presence of acoustic noise and because of the Lombard effect. The authors also show that the performance of an audiovisual speech recognizer depends also on the visual Lombard effect and can be further improved when it is considered in designing such a system.
► When speech is occurred in noisy environment, Lombard effect appears.
► Recognition rates decrease also because of the presence of Lombard effect.
► The Lombard effect is also present in visual speech production.
► Automatic visual speech recognition rates decrease in the presence of audio noise.
► Visual speech recognition rates increases when considering Lombard effect.
Journal: Computer Speech & Language - Volume 27, Issue 1, January 2013, Pages 288–300