کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
559001 875029 2015 25 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Robust speaker localization for real-world robots
ترجمه فارسی عنوان
محلی سازی بلندگوهای قوی برای روبات های دنیای واقعی
کلمات کلیدی
آرایه میکروفون؛محلی سازی آکوستیک؛برآورد تاخیر زمانی؛قدرت پاسخ هدایت شده؛افزایش طیف فاز
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر پردازش سیگنال
چکیده انگلیسی


• We look at the acoustic localization for real-world robots and conditions.
• We investigate the role of the microphones characteristics in the acoustic localization.
• We propose a time delay estimation smoothing which is based on the signal properties.
• We formulate a framework for addressing the robot's shape and surface material influence in the acoustic localization.
• We propose a pre-processing approach for enhancing the acoustic localization performance.

Autonomous human–robot interaction ultimately requires an artificial audition module that allows the robot to process and interpret a combination of verbal and non-verbal auditory inputs. A key component of such a module is the acoustic localization. The acoustic localization not only enables the robot to simultaneously localize multiple persons and auditory events of interest in the environment, but also provides input to auditory tasks such as speech enhancement and speech recognition. The use of microphone arrays in robots is an efficient and commonly applied approach to the localization problem. In this paper, moving away from simulated environments, we look at the acoustic localization under real-world conditions and limitations. Our approach proposes a series of enhancements, taking into account the imperfect frequency response of the array microphones and addressing the influence of the robot's shape and surface material. Motivated by the importance of the signal's phase information, we introduce a novel pre-processing step for enhancing the acoustic localization. Results show that the proposed approach improves the localization performance in joint noisy and reverberant conditions and allows a humanoid robot to locate multiple speakers in a real-world environment.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Computer Speech & Language - Volume 34, Issue 1, November 2015, Pages 129–153
نویسندگان
, , ,