کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
433036 689217 2013 9 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Parallel design for error-resilient entropy coding algorithm on GPU
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر نظریه محاسباتی و ریاضیات
پیش نمایش صفحه اول مقاله
Parallel design for error-resilient entropy coding algorithm on GPU
چکیده انگلیسی

The error-resilient entropy coding (EREC) algorithm is an effective method for combating error propagation at low cost in many compression methods using variable-length coding (VLC). However, the main drawback of the EREC is its high complexity. In order to overcome this disadvantage, a parallel EREC is implemented on a graphics processing unit (GPU) using the NVIDIA CUDA technology. The original EREC is a finer-grained parallel at each stage which brings additional communication overhead. To achieve high efficiency of parallel EREC, we propose partitioning the EREC (P-EREC) algorithm, which splits variable-length blocks into groups and then every group is coded using the EREC separately. Each GPU thread processes one group so as to make the EREC coarse-grained parallel. In addition, some optimization strategies are discussed in order to obtain higher performance using the GPU. In the case that the variable-length data blocks are divided into 128 groups (256 groups, resp.), experimental results show that the parallel P-EREC achieves 32×32× to 123×123× (54×54× to 350×350×, resp.) speedup over the original C code of EREC compiled with the O2O2 optimization option. Higher speedup can even be obtained with more groups. Compared to the EREC, the P-EREC not only achieves a good speedup performance, but it also slightly improves the resilience of the VLC bit-stream against burst or random errors.


► We attempt to optimize the performance of the EREC by parallel processing.
► A parallel EREC is implemented on a graphics processing unit (GPU).
► We propose partitioning the EREC (P-EREC) algorithm.
► We implemented the parallel P-EREC on GPU and optimized techniques were fully used.
► The parallel P-EREC gains 32×32× to 123×123× speedup compared with the original C code.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Journal of Parallel and Distributed Computing - Volume 73, Issue 4, April 2013, Pages 411–419
نویسندگان
, , , ,