دانلود رایگان مقاله: روش های محاسباتی برای محلی سازی و جداسازی گفتار پیچیده و با استفاده از تجزیه و تحلیل مؤلفه های کمی مبتنی بر مدل

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
566009	1452024	2016	17 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

Computational methods for underdetermined convolutive speech localization and separation via model-based sparse component analysis

ترجمه فارسی عنوان

روش های محاسباتی برای محلی سازی و جداسازی گفتار پیچیده و با استفاده از تجزیه و تحلیل مؤلفه های کمی مبتنی بر مدل

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

Reverberation - بازتاب Sparse component analysis - تجزیه و تحلیل جزء Sparse Computational auditory scene analysis - تجزیه و تحلیل صحنه شنوایی محاسباتی Structured sparse representation - نمایندگی نادرست ساختاری

موضوعات مرتبط

مهندسی و علوم پایه مهندسی کامپیوتر پردازش سیگنال

پیش نمایش مقاله

روش های محاسباتی برای محلی سازی و جداسازی گفتار پیچیده و با استفاده از تجزیه و تحلیل مؤلفه های کمی مبتنی بر مدل

چکیده انگلیسی

• Model-based sparse component analysis exploits structured sparsity for source separation.
• Spectral sparsity structures are formulated upon the principles of auditory scene analysis.
• Spatial sparsity structures are formulated upon the image model of multipath propagation.
• Performance of greedy, convex and Bayesian sparse recovery are evaluated.
• Ad hoc microphone arrays may lead to significant improvement in SCA performance.

In this paper, the problem of speech source localization and separation from recordings of convolutive underdetermined mixtures is addressed. This problem is cast as recovering the spatio-spectral speech information embedded in a microphone array compressed measurements of the acoustic field. A model-based sparse component analysis framework is formulated for sparse reconstruction of the speech spectra in a reverberant acoustic resulting in joint localization and separation of the individual sources. We compare and contrast the algorithmic approaches to model-based sparse recovery exploiting spatial sparsity as well as spectral structures underlying spectrographic representation of speech signals. In this context, we explore identification of the sparsity structures at the auditory and acoustic representation spaces. The audiory structures are formulated upon the principles of structural grouping based on proximity, autoregressive correlation and harmonicity of the spectral coefficients and they are incoporated for sparse reconstruction. The acoustic structures are formulated upon the image model of multipath propagation and they are exploited to characterize the compressive measurement matrix associated with microphone array recordings.Three approaches to sparse recovery relying on combinatorial optimization, convex relaxation and sparse Bayesian learning are studied and evaluated on thorough experiments. The sparse Bayesian learning method is shown to yield better perception quality while the interference suppression is also achieved using the combinatorial approach with the advantage of offering the most efficient computational cost. Furthermore, it is demonstrated that an average autoregressive model can be learned for speech localization while exploiting the proximity structure in the form of block sparse coefficients enables accurate localization and high quality speech separation. Throughout the extensive empirical evaluation, we confirm that a large and random placement of the microphones enables significant improvement in source localization and separation performance.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Speech Communication - Volume 76, February 2016, Pages 201–217

نویسندگان

Afsaneh Asaei, Hervé Bourlard, Mohammad J. Taghizadeh, Volkan Cevher,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

دانلود رایگان مقاله ISI : روش های محاسباتی برای محلی سازی و جداسازی گفتار پیچیده و با استفاده از تجزیه و تحلیل مؤلفه های کمی مبتنی بر مدل

دسترسی سریع

ارتباط

English Website