کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
536492 870544 2011 8 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
A new multi-purpose audio-visual UNMC-VIER database with multiple variabilities
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر چشم انداز کامپیوتر و تشخیص الگو
پیش نمایش صفحه اول مقاله
A new multi-purpose audio-visual UNMC-VIER database with multiple variabilities
چکیده انگلیسی

Audio-visual recognition system is becoming popular because it overcomes certain problems of traditional audio-only recognition system. However, difficulties due to visual variations in video sequence can significantly degrade the recognition performance of the system. This problem can be further complicated when more than one visual variation happen at the same time. Although several databases have been created in this area, none of them includes realistic visual variations in video sequence. With the aim to facilitate the development of robust audio-visual recognition systems, the new audio-visual UNMC-VIER database is created. This database contains various visual variations including illumination, facial expression, head pose, and image resolution variations. The most unique aspect of this database is that it includes more than one visual variation in the same video recording. For the audio part, the utterances are spoken in slow and normal speech pace to improve the learning process of audio-visual speech recognition system. Hence, this database is useful for the development of robust audio-visual person, speech recognition and face recognition systems.


► The UNMC-VIER database contains audio and video recordings of 123 subjects.
► This database includes illumination, facial expression, head pose, and image resolution variations.
► Facial expression and head pose variations are taken under different lighting conditions.
► High quality video camcorders and low quality webcam are used to create the image resolution variation.
► Recognition accuracy is degraded when more than one visual variability exist at the same time.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Pattern Recognition Letters - Volume 32, Issue 13, 1 October 2011, Pages 1503–1510
نویسندگان
, , , , , , ,