Article ID Journal Published Year Pages File Type
5099203 Journal of Economic Dynamics and Control 2009 10 Pages PDF
Abstract
In many dynamic programming problems, a mix of state variables exists - some exhibiting stochastic cycles and others having deterministic cycles. We derive a formula for the value function in infinite-horizon, stationary, Markovian decision problems by exploiting a special partitioned-circulant structure of the transition matrix Π. Our strategy for computing the left-inverse of the matrix [I-βΠ], which is central to implementing Howard's policy iteration algorithm, yields significant improvements in computation time and major reductions in memory required. When the deterministic cycle is of order n, our cyclic inversion algorithm yields an O(n2) speed-up relative to the usual policy iteration algorithm.
Related Topics
Physical Sciences and Engineering Mathematics Control and Optimization
Authors
, ,