کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
532109 869910 2014 14 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Towards adaptive learning with improved convergence of deep belief networks on graphics processing units
ترجمه فارسی عنوان
به سوی یادگیری تطبیقی ​​با همگرایی بهتر شبکه های باور عمیق در واحد پردازش گرافیکی
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر چشم انداز کامپیوتر و تشخیص الگو
چکیده انگلیسی


• Adaptive step size technique that enhances the convergence of RBMs and DBNs.
• GPU parallel implementation of the RBMs and DBNs.
• Extensive experiment involving training hundreds of DBNs (MNIST and HHreco datasets).

In this paper we focus on two complementary approaches to significantly decrease pre-training time of a deep belief network (DBN). First, we propose an adaptive step size technique to enhance the convergence of the contrastive divergence (CD) algorithm, thereby reducing the number of epochs to train the restricted Boltzmann machine (RBM) that supports the DBN infrastructure. Second, we present a highly scalable graphics processing unit (GPU) parallel implementation of the CD-k algorithm, which boosts notably the training speed. Additionally, extensive experiments are conducted on the MNIST and the HHreco databases. The results suggest that the maximum useful depth of a DBN is related to the number and quality of the training samples. Moreover, it was found that the lower-level layer plays a fundamental role for building successful DBN models. Furthermore, the results contradict the pre-conceived idea that all the layers should be pre-trained. Finally, it is shown that by incorporating multiple back-propagation (MBP) layers, the DBNs generalization capability is remarkably improved.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Pattern Recognition - Volume 47, Issue 1, January 2014, Pages 114–127
نویسندگان
, ,