Article ID Journal Published Year Pages File Type
535465 Pattern Recognition Letters 2006 8 Pages PDF
Abstract

It is well known that when there is an acoustic mismatch between the speech obtained during training and testing the accuracy of speaker recognition systems drastically deteriorates. In this paper we propose Modified Segmental Histogram Equalization to improve the robustness of a speaker verification system operating in telephone environments. The technique transforms the features extracted from short adjacent segments of speech within an utterance such that their statistics conform to that of a Gaussian distribution with zero mean and unity variance across all recording conditions. In doing so, the feature statistics become less environment-dependent. Experiments performed on the NIST 2000 database show significant improvements in performance.

Related Topics
Physical Sciences and Engineering Computer Science Computer Vision and Pattern Recognition
Authors
, ,