کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
529136 869632 2015 15 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Geometrical-based lip-reading using template probabilistic multi-dimension dynamic time warping
ترجمه فارسی عنوان
خواندن چهره با استفاده از هندسه با استفاده از قالب چندین بعدی احتمال انحراف زمان دینامیکی
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر چشم انداز کامپیوتر و تشخیص الگو
چکیده انگلیسی


• A lip reading system that uses geometrical features is proposed.
• We describe a new robust and computationally inexpensive lip feature extraction method.
• A novel classification approach gives significant performance improvements.

By identifying lip movements and characterizing their associations with speech sounds, the performance of speech recognition systems can be improved, particularly when operating in noisy environments. In this paper, we present a geometrical-based automatic lip reading system that extracts the lip region from images using conventional techniques, but the contour itself is extracted using a novel application of a combination of border following and convex hull approaches. Classification is carried out using an enhanced dynamic time warping technique that has the ability to operate in multiple dimensions and a template probability technique that is able to compensate for differences in the way words are uttered in the training set. The performance of the new system has been assessed in recognition of the English digits 0 to 9 as available in the CUAVE database. The experimental results obtained from the new approach compared favorably with those of existing lip reading approaches, achieving a word recognition accuracy of up to 71% with the visual information being obtained from estimates of lip height, width and their ratio.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Journal of Visual Communication and Image Representation - Volume 30, July 2015, Pages 219–233
نویسندگان
, ,