کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
410929 679170 2006 15 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
PIRANHA: Policy iteration for recurrent artificial neural networks with hidden activities
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی
پیش نمایش صفحه اول مقاله
PIRANHA: Policy iteration for recurrent artificial neural networks with hidden activities
چکیده انگلیسی

It is an intriguing task to develop efficient connectionist representations for learning long time series. Recurrent neural networks have great promises here. We model the learning task as a minimization problem of a nonlinear least-squares cost function, that takes into account both one-step and multi-step prediction errors. The special structure of the cost function is constructed to build a bridge to reinforcement learning. We exploit this connection and derive a convergent, policy iteration-based algorithm, and show that RNN training can be made to fit the reinforcement learning framework in a natural fashion. The relevance of this connection is discussed. We also present experimental results, which demonstrate the appealing properties of the unique parameter structure prescribed by reinforcement learning. Experiments cover both sequence learning and long-term prediction.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Neurocomputing - Volume 70, Issues 1–3, December 2006, Pages 577–591
نویسندگان
, ,