کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
558280 874889 2014 14 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Speaker adaptive voice source modeling with applications to speech coding and processing
ترجمه فارسی عنوان
مدلسازی صوتی منبع انطباق بلندگو با برنامه های کاربردی برای برنامه نویسی و پردازش گفتار
کلمات کلیدی
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر پردازش سیگنال
چکیده انگلیسی


• We model the speech signal by a speaker adapted glottal source and a vocal tract filter.
• The source model used is a physically based dynamical model representing vocal fold oscillation.
• The model is fitted to different speaker voice samples.
• Transformations are operated effectively through control of the glottal model.
• Experimental evidence of the effectiveness of the model is provided through objective and subjective assessment.

We discuss the use of low-dimensional physical models of the voice source for speech coding and processing applications. A class of waveform-adaptive dynamic glottal models and parameter identification procedures are illustrated. The model and the identification procedures are assessed by addressing signal transformations on recorded speech, achievable by fitting the model to the data, and then acting on the physically oriented parameters of the voice source. The class of models proposed provides in principle a tool for both the estimation of glottal source signals, and the encoding of the speech signal for transformation purposes. The application of this model to time stretching and to fundamental frequency control (pitch shifting) is also illustrated. The experiments show that copy synthesis is perceptually very similar to the target, and that time stretching and “pitch extrapolation” effects can be obtained by simple control strategies.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Computer Speech & Language - Volume 28, Issue 5, September 2014, Pages 1195–1208
نویسندگان
, ,