Rhythmic unit extraction and modelling for automatic language identification

Article ID	Journal	Published Year	Pages	File Type
10370832	Speech Communication	2005	21 Pages	PDF

Abstract

This paper deals with an approach to automatic language identification based on rhythmic modelling. Beside phonetics and phonotactics, rhythm is actually one of the most promising features to be considered for language identification, even if its extraction and modelling are not a straightforward issue. Actually, one of the main problems to address is what to model. In this paper, an algorithm of rhythm extraction is described: using a vowel detection algorithm, rhythmic units related to syllables are segmented. Several parameters are extracted (consonantal and vowel duration, cluster complexity) and modelled with a Gaussian Mixture. Experiments are performed on read speech for seven languages (English, French, German, Italian, Japanese, Mandarin and Spanish) and results reach up to 86Â Â±Â 6% of correct discrimination between stress-timed mora-timed and syllable-timed classes of languages, and to 67Â Â±Â 8% of correct language identification on average for the seven languages with utterances of 21Â s. These results are commented and compared with those obtained with a standard acoustic Gaussian mixture modelling approach (88Â Â±Â 5% of correct identification for the seven languages identification task).

Keywords

Language identification