Effective neural network training with adaptive learning rate based on training loss

Article ID	Journal	Published Year	Pages	File Type
6863013	Neural Networks	2018	11 Pages	PDF

Abstract

A method that uses an adaptive learning rate is presented for training neural networks. Unlike most conventional updating methods in which the learning rate gradually decreases during training, the proposed method increases or decreases the learning rate adaptively so that the training loss (the sum of cross-entropy losses for all training samples) decreases as much as possible. It thus provides a wider search range for solutions and thus a lower test error rate. The experiments with some well-known datasets to train a multilayer perceptron show that the proposed method is effective for obtaining a better test accuracy under certain conditions.

Keywords

Neural network training Stochastic Gradient Descent Beam search Learning rate Multilayer perceptron Deep learning