کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
413310 680399 2010 12 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Speaker localization and tracking with a microphone array on a mobile robot using von Mises distribution and particle filtering
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی
پیش نمایش صفحه اول مقاله
Speaker localization and tracking with a microphone array on a mobile robot using von Mises distribution and particle filtering
چکیده انگلیسی

This paper deals with the problem of localizing and tracking a moving speaker over the full range around the mobile robot. The problem is solved by taking advantage of the phase shift between signals received at spatially separated microphones. The proposed algorithm is based on estimating the time difference of arrival by maximizing the weighted cross-correlation function in order to determine the azimuth angle of the detected speaker. The cross-correlation is enhanced with an adaptive signal-to-noise estimation algorithm to make the azimuth estimation more robust in noisy surroundings. A post-processing technique is proposed in which each of these microphone-pair determined azimuths are further combined into a mixture of von Mises distributions, thus producing a practical probabilistic representation of the microphone array measurement. It is shown that this distribution is inherently multimodal and that the system at hand is non-linear. Therefore, particle filtering is applied for discrete representation of the distribution function. Furthermore, the two most common microphone array geometries are analysed and exhaustive experiments were conducted in order to qualitatively and quantitatively test the algorithm and compare the two geometries. Also, a voice activity detection algorithm based on the before-mentioned signal-to-noise estimator was implemented and incorporated into the existing speaker localization system. The results show that the algorithm can reliably and accurately localize and track a moving speaker.

Research highlights
► a voice activity detector is integrated into speaker localization framework
► square array shows smaller error sensitivity than the Y array
► speaker azimuth is calculated in a robust and computationally undemanding manner
► number of cross-correlation evaluations is equal to the number of microphone pairs
► the algorithm performance is verified with an accurate laser leg-tracking algorithm

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Robotics and Autonomous Systems - Volume 58, Issue 11, 30 November 2010, Pages 1185–1196
نویسندگان
, ,