A pilot study on augmented speech communication based on Electro-Magnetic Articulography

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
536009	870429	2011	7 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

Noise robustness - استحکام سر و صدا Automatic speech recognition - تشخیص گفتار خودکار Fusion - فیوژن

موضوعات مرتبط

مهندسی و علوم پایه مهندسی کامپیوتر چشم انداز کامپیوتر و تشخیص الگو

پیش نمایش صفحه اول مقاله

A pilot study on augmented speech communication based on Electro-Magnetic Articulography

چکیده انگلیسی

Speech is the most natural form of communication for human beings. However, in situations where audio speech is not available because of disability or adverse environmental condition, people may resort to alternative methods such as augmented speech, that is, audio speech supplemented or replaced by other modalities, such as audiovisual speech, or Cued Speech. This article introduces augmented speech communication based on Electro-Magnetic Articulography (EMA). Movements of the tongue, lips, and jaw are tracked by EMA and are used as features to create hidden Markov models (HMMs). In addition, automatic phoneme recognition experiments are conducted to examine the possibility of recognizing speech only from articulation, that is, without any audio information. The results obtained are promising, which confirm that phonetic features characterizing articulation are as discriminating as those characterizing acoustics (except for voicing). This article also describes experiments conducted in noisy environments using fused audio and EMA parameters. It has been observed that when EMA parameters are fused with noisy audio speech, the recognition rate increases significantly as compared with using noisy audio speech only.

Research highlights
► An Electro-Magnetic Articulography (EMA) device can capture tongue movements accurately.
► Using Hidden Markov models, articulatory movements of lips/jaw/tongue can be recognized.
► Articulatory features are as discriminant as the acoustic ones (except voicing).
► Tongue movements can be recognized with higher accuracy compared to lips/jaw movements.
► Robustness against noise increases when fusing articulatory features with audio features.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Pattern Recognition Letters - Volume 32, Issue 8, 1 June 2011, Pages 1119–1125

نویسندگان

Panikos Heracleous, Pierre Badin, Gérard Bailly, Norihiro Hagita,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

A pilot study on augmented speech communication based on Electro-Magnetic Articulography

دسترسی سریع

ارتباط

English Website