Multi-accent acoustic modelling of South African English

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
565953	875876	2012	13 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

Under-resourced languages - زبان های زیرزمینی

موضوعات مرتبط

مهندسی و علوم پایه مهندسی کامپیوتر پردازش سیگنال

پیش نمایش صفحه اول مقاله

Multi-accent acoustic modelling of South African English

چکیده انگلیسی

Although English is spoken throughout South Africa it is most often used as a second or third language, resulting in several prevalent accents within the same population. When dealing with multiple accents in this under-resourced environment, automatic speech recognition (ASR) is complicated by the need to compile multiple, accent-specific speech corpora. We investigate how best to combine speech data from five South African accents of English in order to improve overall speech recognition performance. Three acoustic modelling approaches are considered: separate accent-specific models, accent-independent models obtained by pooling training data across accents, and multi-accent models. The latter approach extends the decision-tree clustering process normally used to construct tied-state hidden Markov models (HMMs) by allowing questions relating to accent. We find that multi-accent modelling outperforms accent-specific and accent-independent modelling in both phone and word recognition experiments, and that these improvements are statistically significant. Furthermore, we find that the relative merits of the accent-independent and accent-specific approaches depend on the particular accents involved. Multi-accent modelling therefore offers a mechanism by which speech recognition performance can be optimised automatically, and for hard decisions regarding which data to pool and which to separate to be avoided.

► We consider acoustic modelling of accented speech in an under-resourced setting.
► Five South African English accents are pooled, kept separate and selectively shared.
► We find that the relative merits of pooling and data separation are accent-dependent.
► However tied-state triphone HMMs show consistent improvements when data is shared.
► Selective data sharing allows hard decisions regarding data partitioning be avoided.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Speech Communication - Volume 54, Issue 6, July 2012, Pages 801–813

نویسندگان

Herman Kamper, Félicien Jeje Muamba Mukanya, Thomas Niesler,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

Multi-accent acoustic modelling of South African English

دسترسی سریع

ارتباط

English Website