دانلود رایگان مقاله: تسریع در آموزش شبکه های عصبی عمیق با ناپایداری شیب متناوب

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
4946627	1439410	2017	13 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

Accelerating deep neural network training with inconsistent stochastic gradient descent

ترجمه فارسی عنوان

تسریع در آموزش شبکه های عصبی عمیق با ناپایداری شیب متناوب

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

شبکه های عصبی، فرود شیبدار، کنترل فرآیند آماری،

Stochastic Gradient Descent - تبخیر تصادفی Neural networks - شبکه های عصبی Statistical process control - کنترل فرآیند آماری

موضوعات مرتبط

مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی

پیش نمایش مقاله

تسریع در آموزش شبکه های عصبی عمیق با ناپایداری شیب متناوب

چکیده انگلیسی

Stochastic Gradient Descent (SGD) updates Convolutional Neural Network (CNN) with a noisy gradient computed from a random batch, and each batch evenly updates the network once in an epoch. This model applies the same training effort to each batch, but it overlooks the fact that the gradient variance, induced by Sampling Bias and Intrinsic Image Difference, renders different training dynamics on batches. In this paper, we develop a new training strategy for SGD, referred to as Inconsistent Stochastic Gradient Descent (ISGD) to address this problem. The core concept of ISGD is the inconsistent training, which dynamically adjusts the training effort w.r.t the loss. ISGD models the training as a stochastic process that gradually reduces down the mean of batch's loss, and it utilizes a dynamic upper control limit to identify a large loss batch on the fly. ISGD stays on the identified batch to accelerate the training with additional gradient updates, and it also has a constraint to penalize drastic parameter changes. ISGD is straightforward, computationally efficient and without requiring auxiliary memories. A series of empirical evaluations on real world datasets and networks demonstrate the promising performance of inconsistent training.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Neural Networks - Volume 93, September 2017, Pages 219-229

نویسندگان

Linnan Wang, Yi Yang, Renqiang Min, Srimat Chakradhar,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

دانلود رایگان مقاله ISI : تسریع در آموزش شبکه های عصبی عمیق با ناپایداری شیب متناوب

دسترسی سریع

ارتباط

English Website