کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
4977820 1452011 2017 16 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Underdetermined blind separation of overlapped speech mixtures in time-frequency domain with estimated number of sources
ترجمه فارسی عنوان
تعریف کورکورانه از مخلوط گفتار همپوشانی در دامنه فرکانس زمانی با تعداد برآورد منابع
کلمات کلیدی
تقسیم بر اساس منبع نابینا، حذف سر و صدا، برآورد تعداد منابع، ارزیابی ماتریس مخلوط، تبدیل کوتاه مدت فوریه،
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر پردازش سیگنال
چکیده انگلیسی

Noise suppression and the estimation of the number of sources are two practical issues in applications of underdetermined blind source separation (UBSS). This paper proposes a noise-robust instantaneous UBSS algorithm for highly overlapped speech sources in the short-time Fourier transform (STFT) domain. The proposed algorithm firstly estimates the unknown complex-valued mixing matrix and the number of sources, which are then used to compute the STFT coefficients of corresponding sources at each auto-source time-frequency (TF) point. After that, the original sources are recovered by the inverse STFT. To mitigate the noise effect on the detection of auto-source TF points, we propose a method to effectively detect the auto-term location of the sources by using the principal component analysis (PCA) of the STFTs of noisy mixtures. The PCA-based detection method can achieve similar UBSS outcome as some filtering-based methods. More importantly, an efficient method to estimate the mixing matrix is proposed based on subspace projection and clustering approaches. The number of sources is obtained by counting the number of the resultant clusters. Evaluations have been carried out by using the speech corpus NOIZEUS and the experimental results have shown improved robustness and efficiency of the proposed algorithm.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Speech Communication - Volume 89, May 2017, Pages 1-16
نویسندگان
, , , , ,