Maximizing the probability of attaining a target prior to extinction

Article ID	Journal	Published Year	Pages	File Type
1713765	Nonlinear Analysis: Hybrid Systems	2011	15 Pages	PDF

Abstract

We present a dynamic programming-based solution to the problem of maximizing the probability of attaining a target set before hitting a cemetery set for a discrete-time Markov control process. Under mild hypotheses we establish that there exists a deterministic stationary policy that achieves the maximum value of this probability. We demonstrate how the maximization of this probability can be computed through the maximization of an expected total reward until the first hitting time to either the target or the cemetery set. Martingale characterizations of thrifty, equalizing, and optimal policies in the context of our problem are also established.

Keywords

Dynamic programming Probability maximization