کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
6885851 1444581 2018 33 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
CaFPGA: An automatic generation model for CNN accelerator
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر شبکه های کامپیوتری و ارتباطات
پیش نمایش صفحه اول مقاله
CaFPGA: An automatic generation model for CNN accelerator
چکیده انگلیسی
Convolutional neural networks (CNNs) are gaining considerable popularity in numerous computer-vision applications. A convolutional architecture for fast feature embedding (Caffe) and other general frameworks has been proposed with the development of CNN. The field-programmable gate array (FPGA) as a classical platform is used to accelerate CNNs because CNNs are computationally complex tasks. However, the implementation of CNN on FPGA platforms is difficult. The present study focuses on exploring the performance-resource design space and proposes an automatic generation model to implement the CNN reconfigurable accelerator on the FPGA platform, which uses Caffe description text as its input file. A design-space exploration model is further proposed. This model includes a layer-folding pipeline structure to balance the bandwidth requirements of convolutional and fully connected layers with incremental exploration algorithms to exploit CNN parallelism. The AlexNet, VGG-S, and VGG-16 networks are implemented. The AlexNet accelerator can achieve 593.5 GOPS, and the VGG-16 accelerator can achieve 638.9 GOPS, which is equivalent or even exceeds that of the state-of-the-art CNN accelerator for VGG-16.
ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Microprocessors and Microsystems - Volume 60, July 2018, Pages 196-206
نویسندگان
, , , , ,