کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
384916 660856 2015 15 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Correcting flawed expert knowledge through reinforcement learning
ترجمه فارسی عنوان
اصلاح دانش تخصصی ناقص از طریق تقویت یادگیری
کلمات کلیدی
کسب دانش، استدلال مبتنی بر منطق، تقویت یادگیری، تجدید نظری
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی
چکیده انگلیسی


• Reinforcement learning used to correct erroneous knowledge in a tactical agent.
• Such experiential learning also creates missing knowledge for a tactical agent.
• Prototype was built and extensively tested to verify usefulness of method.

Subject matter experts can sometimes provide incorrect and/or incomplete knowledge in the process of building intelligent systems. Other times, the expert articulates correct knowledge only to be misinterpreted by the knowledge engineer. In yet other cases, changes in the domain can lead to outdated knowledge in the system. This paper describes a technique that improves a flawed tactical agent by revising its knowledge through practice in a simulated version of its operational environment. This form of theory revision repairs agents originally built through interaction with subject matter experts. It is advantageous because such systems can now cease to be completely dependent on human expertise to provide correct and complete domain knowledge. After an agent has been built in consultation with experts, and before it is allowed to become operational, our method permits its improvement by subjecting it to several practice sessions in a simulation of its mission environment. Our method uses reinforcement learning to correct such errors and fill in gaps in the knowledge of a context-based tactical agent. The method was implemented and evaluated by comparing the performance of an agent improved by our method, to the original hand-built agent whose knowledge was purposely seeded with known errors and/or gaps. The results show that the improved agent did in fact correct the seeded errors and did gain the missing knowledge to permit it to perform better than the original, flawed agent.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Expert Systems with Applications - Volume 42, Issues 17–18, October 2015, Pages 6457–6471
نویسندگان
, ,