Valuing programs with deterministic and stochastic cycles

Article ID	Journal	Published Year	Pages	File Type
5099203	Journal of Economic Dynamics and Control	2009	10 Pages	PDF

Abstract

In many dynamic programming problems, a mix of state variables exists - some exhibiting stochastic cycles and others having deterministic cycles. We derive a formula for the value function in infinite-horizon, stationary, Markovian decision problems by exploiting a special partitioned-circulant structure of the transition matrix Î . Our strategy for computing the left-inverse of the matrix [I-Î²Î ], which is central to implementing Howard's policy iteration algorithm, yields significant improvements in computation time and major reductions in memory required. When the deterministic cycle is of order n, our cyclic inversion algorithm yields an O(n2) speed-up relative to the usual policy iteration algorithm.

Keywords

C13 C14 C15 Deterministic cycles Dynamic programming Policy iteration Circulant matrix