دانلود رایگان مقاله: افزایش گفتار توسط سازگاری سر و صدا از مقیاس ادراکی و آستانه ضرایب تبدیل موجک پیوسته

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
568553	1452030	2015	12 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

Speech enhancement by noise driven adaptation of perceptual scales and thresholds of continuous wavelet transform coefficients

ترجمه فارسی عنوان

افزایش گفتار توسط سازگاری سر و صدا از مقیاس ادراکی و آستانه ضرایب تبدیل موجک پیوسته

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

تقویت گفتار، تبدیل موجک مداوم، آستانه سازگاری مقیاس سازگاری تبدیل موجک بیونیک

Adaptive thresholding - آستانه سازگاری CWT, continuous wavelet transform - تبدیل موجک پیوسته speech enhancement - تقویت گفتار

موضوعات مرتبط

مهندسی و علوم پایه مهندسی کامپیوتر پردازش سیگنال

پیش نمایش مقاله

افزایش گفتار توسط سازگاری سر و صدا از مقیاس ادراکی و آستانه ضرایب تبدیل موجک پیوسته

چکیده انگلیسی

• Speech enhancement.
• Adaptive thresholding.
• Adaptive scaling.
• Additive White Gaussian Noise (AWGN).
• Real world noises: Pink, Babble, Car interior, F16 cockpit noise.

This paper focuses on employing adaptive scales for computation of perceptually scaled continuous wavelet transform coefficients (CWT) and adaptive thresholding of these coefficients for speech enhancement. The adaptive scales and thresholds both were decided on the basis of the noise level of the noisy speech signal. The CWT coefficients were scaled perceptually and the proposed algorithm suggests selection of number of scales required for analysis on the basis of noise level. The CWT coefficients were then thresholded and for this a novel method of generating adaptive thresholds that too depends on the noise level of the noisy signal has also been proposed. Speech signals were acquired from the TIMIT database and evaluation of the proposed method is done by corrupting these signals by white Gaussian noise (at −10, −5, 0, 5, 10, 15 and 20 dB SNRs) and four real world noises (each at 0 dB SNR); pink, babble, car interior and F16 cockpit noise from the NOISEX-92 database. Enhancement results are compared on the basis of signal to noise ratio (SNR), segmental SNR (SSNR), spectral distortion (SD) and perceptual evaluation of speech quality (PESQ).Results of the proposed method are evaluated against Ephraim Malah filtering, Stein’s unbiased risk estimate (SURE) thresholding of bionic wavelet transform (BWT) coefficients (BWT-SURE), Wiener filtering (WF), perceptually scaled wavelet packet transform (PWT), multi-model WF and multi-model sparse code shrinkage (MultiSCS) enhancement methods. For the white Gaussian noise case, at all noise levels, SNR and SSNR of the proposed method were better than all the methods under comparison. SD and PESQ results were lower than multiSCS method at 10 dB SNR but better at 15 dB and 20 dB SNRs. For the babble noise case, the obtained results were lower than Ephraim Malah but better than BWT-SURE. SNR and SSNR results for the cockpit noise were comparable with Ephraim Malah and BWT-SURE while for the pink noise case, the proposed method gives the best results.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Speech Communication - Volume 70, June 2015, Pages 1–12

نویسندگان

Preety D. Swami, Rupali Sharma, Alok Jain, Dhirendra K. Swami,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

دانلود رایگان مقاله ISI : افزایش گفتار توسط سازگاری سر و صدا از مقیاس ادراکی و آستانه ضرایب تبدیل موجک پیوسته

دسترسی سریع

ارتباط

English Website