Article ID Journal Published Year Pages File Type
383741 Expert Systems with Applications 2014 15 Pages PDF
Abstract

•We present a unified framework for learning from observation (LfO).•We present a Dynamic Bayesian Network (DBN) model of LfO.•We present a novel set of evaluation metrics for LfO algorithms.•We show evidence that our metrics better capture LfO algorithm performance than metrics used in previous LfO work.

Learning from observation (LfO), also known as learning from demonstration, studies how computers can learn to perform complex tasks by observing and thereafter imitating the performance of a human actor. Although there has been a significant amount of research in this area, there is no agreement on a unified terminology or evaluation procedure. In this paper, we present a theoretical framework based on Dynamic-Bayesian Networks (DBNs) for the quantitative modeling and evaluation of LfO tasks. Additionally, we provide evidence showing that: (1) the information captured through the observation of agent behaviors occurs as the realization of a stochastic process (and often not just as a sample of a state-to-action map); (2) learning can be simplified by introducing dynamic Bayesian models with hidden states for which the learning and model evaluation tasks can be reduced to minimization and estimation of some stochastic similarity measures such as crossed entropy.

Related Topics
Physical Sciences and Engineering Computer Science Artificial Intelligence
Authors
, , ,