کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
558273 874889 2014 17 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Language independent search in MediaEval's Spoken Web Search task
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر پردازش سیگنال
پیش نمایش صفحه اول مقاله
Language independent search in MediaEval's Spoken Web Search task
چکیده انگلیسی


• “Spoken Web Search” allows users to access an audio database with voice queries independent of the language used.
• It is a novel application of speech technology to specifically help under-developed communities.
• Low-resource “query by example” techniques are becoming popular again, and are well suited to solving this problem.
• Rival and complementary techniques include multi-lingual LVCSR and articulatory feature based modeling.
• We present a comparative analysis of these techniques as submitted to the MediaEval evaluation series.

In this paper, we describe several approaches to language-independent spoken term detection and compare their performance on a common task, namely “Spoken Web Search”. The goal of this part of the MediaEval initiative is to perform low-resource language-independent audio search using audio as input. The data was taken from “spoken web” material collected over mobile phone connections by IBM India as well as from the LWAZI corpus of African languages. As part of the 2011 and 2012 MediaEval benchmark campaigns, a number of diverse systems were implemented by independent teams, and submitted to the “Spoken Web Search” task. This paper presents the 2011 and 2012 results, and compares the relative merits and weaknesses of approaches developed by participants, providing analysis and directions for future research, in order to improve voice access to spoken information in low resource settings.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Computer Speech & Language - Volume 28, Issue 5, September 2014, Pages 1066–1082
نویسندگان
, , , , ,