کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
566023 1452025 2015 23 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Analysis of acoustic space variability in speech affected by depression
ترجمه فارسی عنوان
تجزیه و تحلیل تغییرات فضای صوتی در گفتار تحت تاثیر افسردگی
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر پردازش سیگنال
چکیده انگلیسی


• Present novel probabilistic acoustic volume, a robust acoustic variability measure.
• As depression increases phonetic events become concentrated in acoustic space.
• MFCC feature space becomes tightly concentrated with increasing depression.
• Speech trajectory in acoustic space becomes smoother with increasing depression.
• Choice of speech collection paradigm may adversely affect depression detection.

The spectral and energy properties of speech have consistently been observed to change with a speaker’s level of clinical depression. This has resulted in spectral and energy based features being a key component in many speech-based classification and prediction systems. However there has been no in-depth investigation into understanding how acoustic models of spectral features are affected by depression. This paper investigates the hypothesis that the effects of depression in speech manifest as a reduction in the spread of phonetic events in acoustic space as modelled by Gaussian Mixture Models (GMM) in combination with Mel Frequency Cepstral Coefficients (MFCC). Our investigation uses three measures of acoustic variability: Average Weighted Variance (AWV), Acoustic Movement (AM) and Acoustic Volume, which attempt to model depression specific acoustic variations (AWV and Acoustic Volume), or the trajectory of a speech in the acoustic space (AM). Within our analysis we present the Probabilistic Acoustic Volume (PAV) a novel method for robustly estimating Acoustic Volume using a Monte Carlo sampling of the feature distribution being modelled. We show that using an array of PAV points we gain insights into how the concentration of the feature vectors in the feature space changes with depression. Key results – found on two commonly used depression corpora – consistently indicate that as a speaker’s level of depression increases there are statistically significantly reductions in both AWV (−0.44 ⩽ rs ⩽ −0.18 with p < .05) and AM (−0.26 ⩽ rs ⩽ −0.19 with p < .05) values, indicating a decrease in localised acoustic variance and smoothing in acoustic trajectory respectively. Further there are also statistically significant reductions (−0.32 ⩽ rs ⩽ −0.20 with p < .05) in Acoustic Volume measures and strong statistical evidence (−0.48 ⩽ rs ⩽ −0.23 with p < .05) that the MFCC feature space becomes more concentrated. Quantifying these effects is expected to be a key step towards building an objective classification or prediction system which is robust to many of the unwanted – in terms of depression analysis – sources of variability modulated into a speech signal.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Speech Communication - Volume 75, December 2015, Pages 27–49
نویسندگان
, , , , ,