کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
564578 1451744 2015 10 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Binaural source separation based on spatial cues and maximum likelihood model adaptation
ترجمه فارسی عنوان
جداسازی منابع دوقطبی بر اساس نشانه های فضایی و سازگاری مدل حداکثر احتمال
کلمات کلیدی
جداسازی منابع دوطرفه، سازگاری مدل، حداکثر رگرسیون خطی احتمال، پردازش سیگنال آماری، تقویت گفتار
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر پردازش سیگنال
چکیده انگلیسی


• We proposed a two-microphone model-based algorithm for separation of moving sound sources.
• We utilize a spatial-model of sources, and separate source signals accordingly.
• We employ an expectation-maximization algorithm to initialize the model parameters.
• We derive a maximum-likelihood-linear-regression algorithm to adapt the model parameters according to new source locations.

This paper describes a system for separating multiple moving sound sources from two-channel recordings based on spatial cues and a model adaptation technique. We employ a statistical model of observed interaural level and phase differences, where maximum likelihood estimation of model parameters is achieved through an expectation-maximization algorithm. This model is used to partition spectrogram points into several clusters (one cluster per source) and generate spectrogram masks accordingly for isolating individual sound sources. We follow a maximum likelihood linear regression (MLLR) approach for tracking source relocations and adapting model parameters accordingly. The proposed algorithm is able to separate more sources than input channels, i.e. in the underdetermined setting. In simulated anechoic and reverberant environments with two and three speakers, the proposed model-adaptation algorithm yields more than 10 dB gain in signal-to-noise-ratio-improvement for azimuthal source relocations of 15° or more. Moreover, this performance gain is achievable with only 0.6 seconds of input mixture received after relocation.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Digital Signal Processing - Volume 36, January 2015, Pages 174–183
نویسندگان
, , , ,