دانلود رایگان مقاله: پیدا کردن بسیاری از راه های احتمالی هدفمند با استفاده از یادگیری تقویتی

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
4943441	1437634	2017	40 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

Many-objective stochastic path finding using reinforcement learning

ترجمه فارسی عنوان

پیدا کردن بسیاری از راه های احتمالی هدفمند با استفاده از یادگیری تقویتی

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

بسیاری از اهداف تقویت یادگیری، پیدا کردن مسیر تصادفی، تصمیم گیری مستمر در عدم قطعیت، نظریه انتخاب اجتماعی

Social choice theory - نظریه انتخاب اجتماعی

موضوعات مرتبط

مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی

پیش نمایش مقاله

پیدا کردن بسیاری از راه های احتمالی هدفمند با استفاده از یادگیری تقویتی

چکیده انگلیسی

In this paper, we investigate solutions to path finding problems with many conflicting objectives, and introduce a new model-free many objective reinforcement learning algorithm, called Voting Q-learning, that is capable of finding a set of optimal policies in an initially unknown, stochastic environment with several conflicting objectives. Current methods for solving this type of problem rely on Pareto dominance to determine which actions are optimal, which decreases in effectiveness as the number of objectives increases, ultimately selecting actions at random in environments where all potential actions are Pareto optimal. Alternative methods for addressing this problem require interaction with a decision maker or a priori knowledge of the problem structure for guidance towards optimal solutions, making them insufficient for fully autonomous use or problems where preferred solutions are initially unknown. As an alternative, we propose the use of voting methods from social choice theory to determine a set of Pareto optimal policies by aggregating preferences determined by the evaluation of environment conditions for each objective. We demonstrate the effectiveness of this method with multiple deterministic and stochastic many-objective path finding problems that are solved optimally without any advance knowledge of the problem or interaction with a decision maker, showing that our approach is the first to provide optimal performance for an autonomous, intelligent system operating in a many objective environment.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Expert Systems with Applications - Volume 72, 15 April 2017, Pages 371-382

نویسندگان

Bentz Tozer, Thomas Mazzuchi, Shahram Sarkani,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

دانلود رایگان مقاله ISI : پیدا کردن بسیاری از راه های احتمالی هدفمند با استفاده از یادگیری تقویتی

دسترسی سریع

ارتباط

English Website