Article ID | Journal | Published Year | Pages | File Type |
---|---|---|---|---|
725616 | The Journal of China Universities of Posts and Telecommunications | 2015 | 6 Pages |
Abstract
Parallel acceleration of convolution perfectly matched layer (CPML) algorithm suffers from massive division operation which is widely accepted as one of the most expensive operations for the equipment such as graphic processing unit (GPU), field programmable gate array (FPGA) etc. In pursuit of higher efficiency and lower power consumption, this article revisited the CPML theory and proposed a new fast division-free parallel CPML structure. By optimally rearranging the CPML inner iteration process, all the division operators can be eliminated and replaced by recalculating the related field updating coefficients offline. Experiments show that the proposed division-free structure can save more than 50% arithmetic instructions and 25% execution time of the traditional parallel CPML structure without any accuracy loss.
Related Topics
Physical Sciences and Engineering
Engineering
Electrical and Electronic Engineering
Authors
Bai Bing, Niu Zhongqi, Niu Yi, Wei Bing, Zhao Gang,