Article ID Journal Published Year Pages File Type
4634919 Applied Mathematics and Computation 2008 11 Pages PDF
Abstract

We consider a stochastic shortest path problem with associative criteria in which for each node of a graph we choose a probability distribution over the set of successor nodes so as to reach a given target node optimally. We formulate such a problem as an associative Markov decision processes. We show that an optimal value function is a unique solution to an optimality equation and find an optimal stationary policy. Also we give a value iteration method and a policy improvement method.

Related Topics
Physical Sciences and Engineering Mathematics Applied Mathematics
Authors
,