Article ID Journal Published Year Pages File Type
391402 Fuzzy Sets and Systems 2006 9 Pages PDF
Abstract

In this paper, Markov decision models with uncertain transition matrices, which allow a matrix to fluctuate at each step in time, is described by the use of fuzzy sets. We find a Pareto optimal policy maximizing the infinite horizon fuzzy expected discounted reward (FEDR) over all stationary policies under some partial order. The Pareto optimal policies are characterized by maximal solutions of an optimal inclusion including efficient set-functions. As a numerical example, a machine maintenance problem is considered.

Related Topics
Physical Sciences and Engineering Computer Science Artificial Intelligence