کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
529564 869675 2011 8 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Efficient video coding based on audio-visual focus of attention
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر چشم انداز کامپیوتر و تشخیص الگو
پیش نمایش صفحه اول مقاله
Efficient video coding based on audio-visual focus of attention
چکیده انگلیسی

This paper proposes an efficient video coding method using audio-visual focus of attention, which is based on the observation that sound-emitting regions in an audio-visual sequence draw viewers’ attention. First, an audio-visual source localization algorithm is presented, where the sound source is identified by using the correlation between the sound signal and the visual motion information. The localization result is then used to encode different regions in the scene with different quality in such a way that regions close to the source are encoded with higher quality than those far from the source. This is implemented in the framework of H.264/AVC by assigning different quantization parameters for different regions. Through experiments with both standard and high definition sequences, it is demonstrated that the proposed method can yield considerable coding gains over the constant quantization mode of H.264/AVC without noticeable degradation of perceived quality.

Research highlights
► Efficient video coding using audio-visual focus of attention is proposed.
► An audio-visual source localization method is presented.
► Regions far from the source in the scene is encoded with lower quality in H.264/AVC.
► The effectiveness of the method is shown via objective and subjective experiments.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Journal of Visual Communication and Image Representation - Volume 22, Issue 8, November 2011, Pages 704–711
نویسندگان
, , ,