Two-stage cascaded classification approach based on genetic fuzzy learning for speech/music discrimination

Article ID	Journal	Published Year	Pages	File Type
381109	Engineering Applications of Artificial Intelligence	2010	9 Pages	PDF

Abstract

Automatic discrimination of speech and music is an important tool in many multimedia applications. The paper presents a robust and effective approach for speech/music discrimination, which relies on a two-stage cascaded classification scheme. The cascaded classification scheme is composed of a statistical pattern recognition classifier followed by a genetic fuzzy system. For the first stage of the classification scheme, other widely used classifiers, such as neural networks and support vector machines, have also been considered in order to assess the robustness of the proposed classification scheme. Comparison with well-proven signal features is also performed. In this work, the most commonly used genetic learning algorithms (Michigan and Pittsburgh) have been evaluated in the proposed two-stage classification scheme. The genetic fuzzy system gives rise to an improvement of about 4% in the classification accuracy rate. Experimental results show the good performance of the proposed approach with a classification accuracy rate of about 97% for the best trial.

Keywords

Speech/music discrimination Statistical pattern recognition Classifier Evolutionary computation Multimedia