Article ID Journal Published Year Pages File Type
558331 Computer Speech & Language 2013 20 Pages PDF
Abstract

This paper describes a new algorithm for automatically detecting creak in speech signals. Detection is made by utilising two new acoustic parameters which are designed to characterise creaky excitations following previous evidence in the literature combined with new insights from observations in the current work. In particular the new method focuses on features in the Linear Prediction (LP) residual signal including the presence of secondary peaks as well as prominent impulse-like excitation peaks. These parameters are used as input features to a decision tree classifier for identifying creaky regions. The algorithm was evaluated on a range of read and conversational speech databases and was shown to clearly outperform the state-of-the-art. Further experiments involving degradations of the speech signal demonstrated robustness to both white and babble noise, providing better results than the state-of-the-art down to at least 20 dB signal to noise ratio.

► Review of state-of-the-art creak detection algorithms. ► Examination of the excitation characteristics of creak. ► Proposal of two new acoustic parameters to be used for automatically identifying creaky regions in speech signals. ► Evaluation on a range of read and conversational databases containing a variety of speakers, gender and languages. ► Robustness testing involving simulated degradation of speech signals by adding white noise and ‘babble’ noise at varying signal-to-noise ratios.

Related Topics
Physical Sciences and Engineering Computer Science Signal Processing
Authors
, , ,