A Dynamic-Bayesian Network framework for modeling and evaluating learning from observation

Article ID	Journal	Published Year	Pages	File Type
383741	Expert Systems with Applications	2014	15 Pages	PDF

Abstract

•We present a unified framework for learning from observation (LfO).•We present a Dynamic Bayesian Network (DBN) model of LfO.•We present a novel set of evaluation metrics for LfO algorithms.•We show evidence that our metrics better capture LfO algorithm performance than metrics used in previous LfO work.

Learning from observation (LfO), also known as learning from demonstration, studies how computers can learn to perform complex tasks by observing and thereafter imitating the performance of a human actor. Although there has been a significant amount of research in this area, there is no agreement on a unified terminology or evaluation procedure. In this paper, we present a theoretical framework based on Dynamic-Bayesian Networks (DBNs) for the quantitative modeling and evaluation of LfO tasks. Additionally, we provide evidence showing that: (1) the information captured through the observation of agent behaviors occurs as the realization of a stochastic process (and often not just as a sample of a state-to-action map); (2) learning can be simplified by introducing dynamic Bayesian models with hidden states for which the learning and model evaluation tasks can be reduced to minimization and estimation of some stochastic similarity measures such as crossed entropy.

Keywords

dynamic Bayesian networks