Article ID | Journal | Published Year | Pages | File Type |
---|---|---|---|---|
1142629 | Operations Research Letters | 2011 | 6 Pages |
Abstract
This paper deals with discrete-time Markov decision processes with state-dependent discount factors and unbounded rewards/costs. Under general conditions, we develop an iteration algorithm for computing the optimal value function, and also prove the existence of optimal stationary policies. Furthermore, we illustrate our results with a cash-balance model.
► This paper studies Markov decision processes with state-dependent discount factors. ► Suitable conditions are given for the existence of optimal stationary policies. ► Then, an iteration algorithm for computing the optimal value function is developed. ► Finally, a cash balance model is used to illustrate the main results in this paper.
Related Topics
Physical Sciences and Engineering
Mathematics
Discrete Mathematics and Combinatorics
Authors
Qingda Wei, Xianping Guo,