The Effect of Tone Modeling in Vietnamese LVCSR System

Article ID	Journal	Published Year	Pages	File Type
485452	Procedia Computer Science	2016	8 Pages	PDF

Abstract

In this work, the tone modeling approaches are used manifest the tonal structure of Vietnamese and tonal feature is also used to build acoustic models. The results on LVCSR using deep bottleneck features (DBNFs) and different types of pronouncing dictionary, are also presented. The experiments are carried out on the dataset containing speeches on Voice of Vietnam channel (VOV). The results show that the performance of the system using tonal phoneme obtained relative improvements over the best non-tonal phoneme system by 19.25%. The DBNFs systems are applicable on tonal dictionary and adding tonal feature as input feature of the network reached around 18% relative recognition performance.