کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
566050 875914 2012 9 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Assessment of disordered voice via the first rahmonic
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر پردازش سیگنال
پیش نمایش صفحه اول مقاله
Assessment of disordered voice via the first rahmonic
چکیده انگلیسی

A number of studies have shown that the amplitude of the first rahmonic peak (R1) in the cepstrum can be usefully employed to indicate hoarse voice quality. The cepstrum is obtained by taking the inverse Fourier transform of the log-magnitude spectrum. In the present study, a number of spectral pre-processing steps are investigated prior to computing the cepstrum; the pre-processing steps include period-synchronous, period-asynchronous, harmonic-synchronous and harmonic-asynchronous spectral band-limitation analysis. The analysis is applied on both sustained vowels [a] and connected speech signals. The correlation between R1 (the amplitude of the first rahmonic) and perceptual ratings is examined for a corpus comprising 251 speakers. It is observed that the correlation between R1 and perceptual ratings increases when the spectrum is band-limited prior to computing the cepstrum. In addition, comparisons are made with a previously reported cepstral cue, cepstral peak prominence (CPP).


► The amplitude of the first rahmonic peak obtained for connected speech and sustained vowels.
► The amplitude of the first rahmonic peak correlates with perceived hoarseness.
► Period-synchronous and harmonic-limited analyses increase correlation.
► Comparisons between the amplitude of the first rahmonic peak and cepstral peak prominence.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Speech Communication - Volume 54, Issue 5, June 2012, Pages 655–663
نویسندگان
, , , , ,