Analysis of multimodal sequences using geometric video representations

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
564965	875663	2006	15 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

موضوعات مرتبط

مهندسی و علوم پایه مهندسی کامپیوتر پردازش سیگنال

پیش نمایش صفحه اول مقاله

Analysis of multimodal sequences using geometric video representations

چکیده انگلیسی

This paper presents a novel method to correlate audio and visual data generated by the same physical phenomenon, based on sparse geometric representation of video sequences. The video signal is modeled as a sum of geometric primitives evolving through time, that jointly describe the geometric and motion content of the scene. The displacement through time of relevant visual features, like the mouth of a speaker, can thus be compared with the evolution of an audio feature to assess the correspondence between acoustic and visual signals. Experiments show that the proposed approach allows to localize and track the speaker's mouth when several persons are present on the scene, in presence of distracting motion, and without prior face or mouth detection.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Signal Processing - Volume 86, Issue 12, December 2006, Pages 3534–3548

نویسندگان

Gianluca Monaci, Òscar Divorra Escoda, Pierre Vandergheynst,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

Analysis of multimodal sequences using geometric video representations

دسترسی سریع

ارتباط

English Website