دانلود رایگان مقاله: الگوریتم یادگیری تصادفی مرتبه دوم برای آموزش شبکه عصبی بهینه شده است

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
408414	679027	2016	16 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

An optimized second order stochastic learning algorithm for neural network training

ترجمه فارسی عنوان

الگوریتم یادگیری تصادفی مرتبه دوم برای آموزش شبکه عصبی بهینه شده است

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

Distributed machine learning - آموزش توزیع شده ماشین Computational efficiency - بازده محاسباتی Convolutional neural network - شبکه عصبی همجوشی Fast convergence - همگرایی سریع

موضوعات مرتبط

مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی

پیش نمایش مقاله

الگوریتم یادگیری تصادفی مرتبه دوم برای آموزش شبکه عصبی بهینه شده است

چکیده انگلیسی

This paper proposes an improved stochastic second order learning algorithm for supervised neural network training. The proposed algorithm, named bounded stochastic diagonal Levenberg–Marquardt (B-SDLM), utilizes both gradient and curvature information to achieve fast convergence while requiring only minimal computational overhead than the stochastic gradient descent (SGD) method. B-SDLM has only a single hyperparameter as opposed to most other learning algorithms that suffer from the hyperparameter overfitting problem due to having more hyperparameters to be tuned. Experiments using the multilayer perceptron (MLP) and convolutional neural network (CNN) models have shown that B-SDLM outperforms other learning algorithms with regard to the classification accuracies and computational efficiency (about 5.3% faster than SGD on the mnist-rot-bg-img database). It can classify all testing samples correctly on the face recognition case study based on AR Purdue database. In addition, experiments on handwritten digit classification case studies show that significant improvements of 19.6% on MNIST database and 17.5% on mnist-rot-bg-img database can be achieved in terms of the testing misclassification error rates (MCRs). The computationally expensive Hessian calculations are kept to a minimum by using just 0.05% of the training samples in its estimation or updating the learning rates once per two training epochs, while maintaining or even achieving lower testing MCRs. It is also shown that B-SDLM works well in the mini-batch learning mode, and we are able to achieve 3.32×3.32× performance speedup when deploying the proposed algorithm in a distributed learning environment with a quad-core processor.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Neurocomputing - Volume 186, 19 April 2016, Pages 74–89

نویسندگان

Shan Sung Liew, Mohamed Khalil-Hani, Rabia Bakhteri,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

دانلود رایگان مقاله ISI : الگوریتم یادگیری تصادفی مرتبه دوم برای آموزش شبکه عصبی بهینه شده است

دسترسی سریع

ارتباط

English Website