| Article ID | Journal | Published Year | Pages | File Type |
|---|---|---|---|---|
| 4634919 | Applied Mathematics and Computation | 2008 | 11 Pages |
Abstract
We consider a stochastic shortest path problem with associative criteria in which for each node of a graph we choose a probability distribution over the set of successor nodes so as to reach a given target node optimally. We formulate such a problem as an associative Markov decision processes. We show that an optimal value function is a unique solution to an optimality equation and find an optimal stationary policy. Also we give a value iteration method and a policy improvement method.
Related Topics
Physical Sciences and Engineering
Mathematics
Applied Mathematics
Authors
Yoshio Ohtsubo,
