Article ID Journal Published Year Pages File Type
478427 European Journal of Operational Research 2012 9 Pages PDF
Abstract

This paper provides a unified framework to study monotone optimal control for a class of Markov decision processes through D-multimodularity. We demonstrate that each system in this class can be classified as either a substitution-type or a complement-type system according to the possible transition set, which can be used as a classification mechanism that integrates a variety of models in the literature. We develop a generic proof of the structural properties of both types of system. In particular, we show that D-multimodularity is a generally sufficient condition for monotone optimal control of different types of system in this class. With this unified theory, there is no need to pursue each problem ad hoc and the structural properties of this class of MDPs follow with ease.

► A unified framework to study monotone optimal control for a class of Markov decision processes through D-multimodularity. ► System classification mechanism and generic proof of structural properties. ► With this unified theory, no need to pursue each problem ad hoc and structural properties of this class follow with ease.

Related Topics
Physical Sciences and Engineering Computer Science Computer Science (General)
Authors
, ,