کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
559062 875043 2012 15 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
The design and collection of COSINE, a multi-microphone in situ speech corpus recorded in noisy environments
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر پردازش سیگنال
پیش نمایش صفحه اول مقاله
The design and collection of COSINE, a multi-microphone in situ speech corpus recorded in noisy environments
چکیده انگلیسی

We present an overview of the data collection and transcription efforts for the COnversational Speech In Noisy Environments (COSINE) corpus. The corpus is a set of multi-party conversations recorded in real world environments with background noise. It can be used to train noise-robust speech recognition systems or develop speech de-noising algorithms. We explain the motivation for creating such a corpus, and describe the resulting audio recordings and transcriptions that comprise the corpus. These high quality recordings were captured in situ on a custom wearable recording system, whose design and construction is also described. On separate synchronized audio channels, seven-channel audio is captured with a 4-channel far-field microphone array, along with a close-talking, a monophonic far-field, and a throat microphone. This corpus thus creates many possibilities for speech algorithm research.


► We present the COnversational Speech In Noisy Environments (COSINE) corpus.
► COSINE consists of multi-party conversations recorded in noisy environments.
► The recordings were captured in situ on a custom wearable portable recording system.
► Seven separate heterogeneous synchronized audio channels have been captured.
► The corpus is useful for noise-robust ASR and speech de-noising algorithms.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Computer Speech & Language - Volume 26, Issue 1, January 2012, Pages 52–66
نویسندگان
, , , , ,