کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
568996 876514 2006 15 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Efficient scalable encoding for distributed speech recognition
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر پردازش سیگنال
پیش نمایش صفحه اول مقاله
Efficient scalable encoding for distributed speech recognition
چکیده انگلیسی

The problem of encoding speech features in the context of a distributed speech recognition system is addressed. Specifically, speech features are compressed using scalable encoding techniques to provide a multi-resolution bitstream. The use of this scalable encoding procedure is investigated in conjunction with a multi-pass distributed speech recognition (DSR) system. The multi-pass DSR system aims at progressive refinement in terms of recognition performance, (i.e., as additional bits are transmitted the recognition can be refined to improve the performance) and is shown to provide both bandwidth and complexity (latency) reductions. The proposed encoding schemes are well suited for implementation on light-weight mobile devices where varying ambient conditions and limited computational capabilities pose a severe constraint in achieving good recognition performance. The multi-pass DSR system is capable of adapting to varying network and system constraints by operating at an appropriate trade-off point between transmission rate, recognition performance and complexity to provide desired quality of service (QoS) to the user. The system was tested using two case studies. In the first, a distributed two-stage names recognition task, the scalable encoder operating at a bitrate of 4.6 kb/s achieved the same performance as that achieved using uncompressed features. In the second study, a two stage multi-pass continuous speech recognition task using HUB-4 data, the scalable encoder at a bitrate of 5.7 kb/s achieved the same performance as that achieved with uncompressed features. Reducing the bitrate to 4800 b/s resulted in a 1% relative increase in WER.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Speech Communication - Volume 48, Issue 8, August 2006, Pages 888–902
نویسندگان
, , ,