کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
558350 874908 2013 13 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
The PASCAL CHiME speech separation and recognition challenge
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر پردازش سیگنال
پیش نمایش صفحه اول مقاله
The PASCAL CHiME speech separation and recognition challenge
چکیده انگلیسی

Distant microphone speech recognition systems that operate with human-like robustness remain a distant goal. The key difficulty is that operating in everyday listening conditions entails processing a speech signal that is reverberantly mixed into a noise background composed of multiple competing sound sources. This paper describes a recent speech recognition evaluation that was designed to bring together researchers from multiple communities in order to foster novel approaches to this problem. The task was to identify keywords from sentences reverberantly mixed into audio backgrounds binaurally recorded in a busy domestic environment. The challenge was designed to model the essential difficulties of the multisource environment problem while remaining on a scale that would make it accessible to a wide audience. Compared to previous ASR evaluations a particular novelty of the task is that the utterances to be recognised were provided in a continuous audio background rather than as pre-segmented utterances thus allowing a range of background modelling techniques to be employed. The challenge attracted thirteen submissions. This paper describes the challenge problem, provides an overview of the systems that were entered and provides a comparison alongside both a baseline recognition system and human performance. The paper discusses insights gained from the challenge and lessons learnt for the design of future such evaluations.


► The paper reviews a recent distant microphone speech recognition evaluation that attracted participation from 13 entrants.
► The paper presents a comparative analysis of the recognition systems that were entered.
► Results of the automatic systems are present and compared to human performance. Common features of successful systems are identified.
► The paper concludes with a brief discussion of possible directions for future challenges.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Computer Speech & Language - Volume 27, Issue 3, May 2013, Pages 621–633
نویسندگان
, , , , ,