PIRANHA: Policy iteration for recurrent artificial neural networks with hidden activities

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
410929	679170	2006	15 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

Policy iteration - تکرار سیاست Recurrent neural networks - شبکه های عصبی راجعه Multi-step prediction - پیش بینی چند مرحله ای Sequence learning - یادگیری توالی

موضوعات مرتبط

مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی

پیش نمایش صفحه اول مقاله

PIRANHA: Policy iteration for recurrent artificial neural networks with hidden activities

چکیده انگلیسی

It is an intriguing task to develop efficient connectionist representations for learning long time series. Recurrent neural networks have great promises here. We model the learning task as a minimization problem of a nonlinear least-squares cost function, that takes into account both one-step and multi-step prediction errors. The special structure of the cost function is constructed to build a bridge to reinforcement learning. We exploit this connection and derive a convergent, policy iteration-based algorithm, and show that RNN training can be made to fit the reinforcement learning framework in a natural fashion. The relevance of this connection is discussed. We also present experimental results, which demonstrate the appealing properties of the unique parameter structure prescribed by reinforcement learning. Experiments cover both sequence learning and long-term prediction.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Neurocomputing - Volume 70, Issues 1–3, December 2006, Pages 577–591

نویسندگان

István Szita, András Lőrincz,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

PIRANHA: Policy iteration for recurrent artificial neural networks with hidden activities

دسترسی سریع

ارتباط

English Website