Article ID Journal Published Year Pages File Type
6865251 Neurocomputing 2018 9 Pages PDF
Abstract
Activation functions play important roles in deep convolutional neural networks. This work focuses on learning activation functions via combining basic activation functions in a data-driven way. We explore three strategies to learn the activation functions, and allow the activation operation to be adaptive to inputs. We firstly explore two strategies to linearly and nonlinearly combine basic activation functions, respectively. Then we further investigate a strategy that basic activation functions are combined in a way of a hierarchical integration. Experiments demonstrate that the proposed activation functions lead to better performances than ReLU and its variants on benchmarks with various scales.
Related Topics
Physical Sciences and Engineering Computer Science Artificial Intelligence
Authors
, , , , ,