Article ID Journal Published Year Pages File Type
534955 Pattern Recognition Letters 2009 6 Pages PDF
Abstract

Gaussian mixture model is the conventional approach employed in speaker recognition tasks. Although it is efficient to model specific speaking characteristics of a speaker, especially in quiet environments, its performance in noisy conditions is still far from the human cognitive process. Recently, a new method of αα-integration of stochastic models has been proposed based on psychophysical experiments that suggests αα-integration is used in a human brain. In this paper, we proposed a method to extend the conventional GMM to the αα-integrated GMM (αα-GMM) to model personal speaking traits. Model parameters were re-estimated recursively based on a given data set. The experiments showed that the new approach significantly outperforms the traditional method, especially on telephony speech.

Related Topics
Physical Sciences and Engineering Computer Science Computer Vision and Pattern Recognition
Authors
, , ,