A fuzzy approach to Markov decision processes with uncertain transition probabilities

Article ID	Journal	Published Year	Pages	File Type
391402	Fuzzy Sets and Systems	2006	9 Pages	PDF

Abstract

In this paper, Markov decision models with uncertain transition matrices, which allow a matrix to fluctuate at each step in time, is described by the use of fuzzy sets. We find a Pareto optimal policy maximizing the infinite horizon fuzzy expected discounted reward (FEDR) over all stationary policies under some partial order. The Pareto optimal policies are characterized by maximal solutions of an optimal inclusion including efficient set-functions. As a numerical example, a machine maintenance problem is considered.