Article ID Journal Published Year Pages File Type
1142997 Operations Research Letters 2007 8 Pages PDF
Abstract

In ergodic MDPs we consider stationary distributions of policies that coincide in all but n   states, in which one of two possible actions is chosen. We give conditions and formulas for linear dependence of the stationary distributions of n+2n+2 such policies, and show some results about combinations and mixtures of policies.

Related Topics
Physical Sciences and Engineering Mathematics Discrete Mathematics and Combinatorics
Authors
,