کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
6856229 1437949 2018 35 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Dominant speaker detection in multipoint video communication using Markov chain with non-linear weights and dynamic transition window
ترجمه فارسی عنوان
تشخیص بلندگو غالب در ارتباطات چند رسانه ای با استفاده از زنجیره مارکوف با وزن غیر خطی و پنجره انتقال پویا
کلمات کلیدی
زنجیره مارکوف، تشخیص غالب سخنران، ارتباطات چند نقطه ای
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی
چکیده انگلیسی
This paper proposes an enhanced discrete-time Markov chain algorithm in predicting dominant speaker(s) for multipoint video communication system in the presence of transient speech. The proposed algorithm exploits statistical properties of the past speech patterns to accurately predict the dominant speaker for the next time state. Non-linear weights-based coefficients are employed in the enhanced Markov chain for both the initial state vector and transition probability matrix. These weights significantly improve the time taken to predict a new dominant speaker during a conference session. In addition, a mechanism to dynamically modify the size of the transition probability matrix window/container is introduced to improve the adaptability of the Markov chain towards the variability of speech characteristics. Simulation results indicate that for an 11 conference participants test scenario, the enhanced Markov chain prediction algorithm registered an 85% accuracy in predicting a dominant speaker when compared to an ideal case where there is no transient speech. Misclassification of dominant speakers due to transient speech was also reduced by 87%.
ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Information Sciences - Volumes 463–464, October 2018, Pages 344-362
نویسندگان
, , , , ,