کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
566182 875949 2009 17 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Speaker-independent phoneme alignment using transition-dependent states
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر پردازش سیگنال
پیش نمایش صفحه اول مقاله
Speaker-independent phoneme alignment using transition-dependent states
چکیده انگلیسی

Determining the location of phonemes is important to a number of speech applications, including training of automatic speech recognition systems, building text-to-speech systems, and research on human speech processing. Agreement of humans on the location of phonemes is, on average, 93.78% within 20 ms on a variety of corpora, and 93.49% within 20 ms on the TIMIT corpus. We describe a baseline forced-alignment system and a proposed system with several modifications to this baseline. Modifications include the addition of energy-based features to the standard cepstral feature set, the use of probabilities of a state transition given an observation, and the computation of probabilities of distinctive phonetic features instead of phoneme-level probabilities. Performance of the baseline system on the test partition of the TIMIT corpus is 91.48% within 20 ms, and performance of the proposed system on this corpus is 93.36% within 20 ms. The results of the proposed system are a 22% relative reduction in error over the baseline system, and a 14% reduction in error over results from a non-HMM alignment system. This result of 93.36% agreement is the best known reported result on the TIMIT corpus.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Speech Communication - Volume 51, Issue 4, April 2009, Pages 352–368
نویسندگان
,