Dominant speaker identification for multipoint videoconferencing

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
558324	874902	2013	16 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

Acoustic noise - سر و صدای آکوستیک Videoconference - ویدئو کنفرانس Speech processing - پردازش گفتار

موضوعات مرتبط

مهندسی و علوم پایه مهندسی کامپیوتر پردازش سیگنال

پیش نمایش صفحه اول مقاله

Dominant speaker identification for multipoint videoconferencing

چکیده انگلیسی

A multi-point conference is an efficient and cost effective substitute for a face to face meeting. It involves three or more participants placed in separate locations, where each participant employs a single microphone and camera. The routing and processing of the audiovisual information is very demanding on the network. This raises a need for reducing the amount of information that flows through the system. One solution is to identify the dominant speaker and partially discard information originating from non-active participants. We propose a novel method for dominant speaker identification using speech activity information from time intervals of different lengths. The proposed method processes the audio signal of each participant independently and computes speech activity scores for the immediate, medium and long time-intervals. These scores are compared and the dominant speaker is identified. In comparison to other speaker selection methods, experimental results demonstrate reduction in the number of false speaker switches and improved robustness to transient audio interferences.

► We propose dominant speaker identification based on intervals of different lengths.
► Speech activity scores are computed for each participant in the videoconference.
► The scores are compared, and the dominant speaker is identified.
► Experimental results demonstrate robustness to transient audio interferences.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Computer Speech & Language - Volume 27, Issue 4, June 2013, Pages 895–910

نویسندگان

Ilana Volfin, Israel Cohen,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

Dominant speaker identification for multipoint videoconferencing

دسترسی سریع

ارتباط

English Website