کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
4977786 1452008 2017 10 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
A permutation algorithm based on dynamic time warping in speech frequency-domain blind source separation
ترجمه فارسی عنوان
الگوریتم جایگزینی براساس زمان بندی پویا در جدایی منبع کورس فرکانس گفتار
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر پردازش سیگنال
چکیده انگلیسی
Frequency-Domain Blind Source Separation (FD-BSS) is an efficient way to analyze convolutive mixed speech. To improve the quality of the separated speech, a permutation algorithm based on Dynamic Time Warping (DTW) is proposed in this paper. Because signals in adjacent frequency bins have high similarity, DTW technology is used to compare them and generate adjustment matrices to solve the permutation ambiguity. Our approach is evaluated through simulated and practical experiments. Using Signal to Distortion Ratio (SDR), Signal to Interference Ratio (SIR), Signal to Artifacts Ratio (SAR), and Perceptual Estimation of the Speech Quality (PESQ) for measurements. To examine the quality of the separated speech in a practical acoustic environment, we adopt the accuracy ratio of Automatic Speech Recognition (ASR). In the experiments, we compare our approach with other classical permutation criteria such as K-L divergence distance, envelope correlation and higher-order statistics. The experimental results show that the proposed algorithm performs permutation alignment more accurately and improves the acoustic quality of separation.
ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Speech Communication - Volume 92, September 2017, Pages 132-141
نویسندگان
, , , , ,