کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
564964 875663 2006 16 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Audio-visual perception of a lecturer in a smart seminar room
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر پردازش سیگنال
پیش نمایش صفحه اول مقاله
Audio-visual perception of a lecturer in a smart seminar room
چکیده انگلیسی

In this paper we present our work on audio-visual perception of a lecturer in a smart seminar room, which is equipped with various cameras and microphones. We present a novel approach to track the lecturer based on visual and acoustic observations in a particle filter framework. This approach does not require explicit triangulation of observations in order to estimate the 3D location of the lecturer, thus allowing for fast audio-visual tracking. We also show how automatic recognition of the lecturer's speech from far-field microphones can be improved using his or her tracked location in the room. Based on the tracked location of the lecturer, we can also detect his or her face in the various camera views for further analysis, such as his or her head orientation and identity. The paper describes the overall system and the various components (tracking, speech recognition, head orientation, identification) in detail and presents results on several multimodal recordings of seminars.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Signal Processing - Volume 86, Issue 12, December 2006, Pages 3518–3533
نویسندگان
, , , , , , ,