Article ID Journal Published Year Pages File Type
562370 Signal Processing 2015 12 Pages PDF
Abstract

•An efficient online preprocessing method is proposed for robust speech recognition.•The method extracts target speech with target speaker direction as a prior knowledge.•A parameter adaptation rule was derived by modifying independent component analysis.•The method achieves robustness by adapting parameters with fixed noise estimation.•On average, the method showed better recognition performance with fewer computations.

This paper describes an efficient online target-speech-extraction method used as a preprocessing step for robust automatic speech recognition (ASR). Because a target speaker is located relatively close to microphones in many ASR applications, acoustic paths to microphones are moderately reverberant, and the target speaker direction can easily be estimated. In this situation, noise estimation is effectively performed by forming a directional null to the target speaker. Required weights for extracting target speech, independent of the estimated noise, are then determined using an adaptation rule derived from a modified version of the cost function for independent component analysis (ICA), while retaining the minimal distortion principle. In particular, an online natural-gradient learning rule with a nonholonomic constraint and normalization by a smoothed power estimate of the input signal is derived for stable convergence, even for dynamically changing speech levels, with much less computational complexity than conventional ICA. Furthermore, stereo mixtures are considered as input data for further reduction of computational loads and fast convergence. Although the method may suffer from the underdetermined problem, the weights are adapted to obtain signal-to-noise-ratio-maximization beamformers for successful target speech estimation. The experimental results obtained for various conditions demonstrate the effectiveness of the proposed method.

Related Topics
Physical Sciences and Engineering Computer Science Signal Processing
Authors
, ,