کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
6961058 1452028 2015 11 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Statistical parametric speech synthesis using a hidden trajectory model
ترجمه فارسی عنوان
سنتز گفتاری پارامتریک گفتاری با استفاده از یک مدل مسیر پنهان
کلمات کلیدی
سنتز گفتار، مدل مخفی مارکف، مدل مسیر پنهان، تولید سخنرانی،
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر پردازش سیگنال
چکیده انگلیسی
A novel spectral modeling method for statistical parametric speech synthesis using a hidden trajectory model (HTM) is presented in this paper. An HTM is a structured generative model with a two-stage implementation. First hidden formant trajectories are generated from time-aligned formant target sequences by a bidirectional filter. This target-filtering model could provide a correlation structure across temporal frames and describe the effect of co-articulation on speech signals efficiently. Then the observed cepstral features are constituted by a formant-related component and a residual component. The formant-related component is predicted from hidden formant trajectories using a nonlinear and analytical function, and the prediction residuals are modeled by context-dependent Gaussians. In this paper, we apply HTM-based acoustic modeling to speech synthesis and investigate the effectiveness of this method in improving the naturalness and controllability of synthetic speech. Experimental results show that this proposed method can improve the accuracy of spectral feature prediction and the naturalness of synthetic speech compared with the conventional HMM-based method, especially for the conditions where the amount of training data is limited. Furthermore, this method can achieve effective controllability on vowel quality and formant sharpness of synthetic speech by conveniently manipulating the distribution parameters for the phone-dependent targets of formant frequencies and bandwidths.
ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Speech Communication - Volume 72, September 2015, Pages 149-159
نویسندگان
, , ,