کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
7109209 1460628 2018 7 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Stochastic optimal control via forward and backward stochastic differential equations and importance sampling
ترجمه فارسی عنوان
کنترل بهینه تصادفی با استفاده از معادلات دیفرانسیل پیشین و عقب ماندگار و انتخاب اهمیت
کلمات کلیدی
کنترل تصادفی، معادلات دیفرانسیل پیشین و عقب مانده، نمونه گیری اهمیت،
موضوعات مرتبط
مهندسی و علوم پایه سایر رشته های مهندسی کنترل و سیستم های مهندسی
چکیده انگلیسی
The aim of this work is to present a novel sampling-based numerical scheme designed to solve a certain class of stochastic optimal control problems, utilizing forward and backward stochastic differential equations (FBSDEs). By means of a nonlinear version of the Feynman-Kac lemma, we obtain a probabilistic representation of the solution to the nonlinear Hamilton-Jacobi-Bellman equation, expressed in the form of a system of decoupled FBSDEs. This system of FBSDEs can be solved by employing linear regression techniques. The proposed framework relaxes some of the restrictive conditions present in recent sampling based methods within the Linearly Solvable Optimal Control framework, and furthermore addresses problems in which the time horizon is not prespecified. To enhance the efficiency of the proposed scheme when treating more complex nonlinear systems, we then derive an iterative algorithm based on Girsanov's theorem on the change of measure, which features importance sampling. This scheme is shown to be capable of learning the optimal control without requiring an initial guess.
ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Automatica - Volume 87, January 2018, Pages 159-165
نویسندگان
, ,