Incremental word learning: Efficient HMM initialization and large margin discriminative adaptation

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
567437	876078	2012	20 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

Discriminative training - آموزش تبعیض آمیز multiple sequence alignment - ترتیب توالی چندگانه Speech recognition - تشخیص گفتار Bootstrapping - راه ‌اندازی خودکار، بوت‌ استرپینگ

موضوعات مرتبط

مهندسی و علوم پایه مهندسی کامپیوتر پردازش سیگنال

پیش نمایش صفحه اول مقاله

Incremental word learning: Efficient HMM initialization and large margin discriminative adaptation

چکیده انگلیسی

In this paper we present an incremental word learning system that is able to cope with few training data samples to enable speech acquisition in on-line human robot interaction. As with most automatic speech recognition systems (ASR), our architecture relies on a Hidden Markov Model (HMM) framework where the different word models are sequentially trained and the system has little prior knowledge. To achieve good performance, HMMs depends on the amount of training data, the initialization procedure and the efficiency of the discriminative training algorithms. Thus, we propose different approaches to improve the system. One major problem of using a small amount of training data is over-fitting. Hence we present a novel estimation of the variance floor dependent on the number of available training samples. Next, we propose a bootstrapping approach in order to get a good initialization of the HMM parameters. This method is based on unsupervised training of the parameters and subsequent construction of a new HMM by aligning and merging Viterbi decoded sequences. Finally, we investigate large margin discriminative training techniques to enlarge the generalization performance of the models using several strategies suitable for limited training data. In the evaluation of the results, we examine the contribution of the different stages proposed to the overall system performance. This includes the comparison of different state-of-the-art methods with our presented techniques and the investigation of the possible reduction of the number of training data samples. We compare our algorithms on isolated and continuous digit recognition tasks. To sum up, we show that the proposed algorithms yield significant improvements and are a step towards efficient learning with few examples.

► Incremental word learning system to cope with few training samples for human-robot interaction.
► Hidden Markov Model (HMM) framework with little prior knowledge.
► A novel estimation of a variance floor dependent on the number of available samples.
► An initialization based on unsupervised training plus aligning and merging Viterbi sequences.
► Large margin discriminative training strategies suitable for limited training data.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Speech Communication - Volume 54, Issue 9, November 2012, Pages 1029–1048

نویسندگان

Irene Ayllón Clemente, Martin Heckmann, Britta Wrede,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

Incremental word learning: Efficient HMM initialization and large margin discriminative adaptation

دسترسی سریع

ارتباط

English Website