A consistency analysis on an acoustic module for Mandarin text-to-speech

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
567277	876066	2013	12 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

Vector Quantization (VQ)Consistency analysis - تجزیه و تحلیل انطباق Speech synthesis - سنتز گفتار Hidden Markov Model (HMM) - مدل مارکف مخفی (HMM)

موضوعات مرتبط

مهندسی و علوم پایه مهندسی کامپیوتر پردازش سیگنال

پیش نمایش صفحه اول مقاله

A consistency analysis on an acoustic module for Mandarin text-to-speech

چکیده انگلیسی

In this work, a consistency analysis on an acoustic module for a Mandarin text-to-speech (TTS) is presented as a way to improve the speech quality. Found by an inspection on the pronunciation process of human beings, the consistency can be interpreted as a high correlation of a warping curve between the spectrum and the prosody intra a syllable. Through three steps in the procedure of the consistency analysis, the HMM algorithm is used firstly to decode HMM-state sequences within a syllable at the same time as to divide them into three segments. Secondly, based on a designated syllable, the vector quantization (VQ) with the Linde–Buzo–Gray (LBG) algorithm is used to train the VQ codebooks of each segment. Thirdly, the prosodic vector of each segment is encoded as an index by VQ codebooks, and then the probability of each possible path is evaluated as a prerequisite to analyze the consistency. It is demonstrated experimentally that a consistency is definitely acquired in case the syllable is located exactly in the same word. These results offer a research direction that the warping process between the spectrum and the prosody intra a syllable must be considered in a TTS system to improve the speech quality.

► The consistency between the spectrum and the prosody intra a syllable is proposed.
► We model and experiment on the consistency analysis concerning prosodic units.
► The consistency is verified while the same syllables located in the same word.
► The same syllables located in different words bring distinct consistency.
► Analytic results offer to improve the speech quality of Mandarin text-to-speech.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Speech Communication - Volume 55, Issue 2, February 2013, Pages 266–277

نویسندگان

Cheng-Yu Yeh, Shun-Chieh Chang, Shaw-Hwa Hwang,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

A consistency analysis on an acoustic module for Mandarin text-to-speech

دسترسی سریع

ارتباط

English Website