Reinforcement learning in population games

Article ID	Journal	Published Year	Pages	File Type
5071801	Games and Economic Behavior	2013	29 Pages	PDF

Abstract

We study reinforcement learning in a population game. Agents in a population game revise mixed strategies using the Cross rule of reinforcement learning. The population state-the probability distribution over the set of mixed strategies-evolves according to the replicator continuity equation which, in its simplest form, is a partial differential equation. The replicator dynamic is a special case in which the initial population state is homogeneous, i.e. when all agents use the same mixed strategy. We apply the continuity dynamic to various classes of symmetric games. Using 3Ã3 coordination games, we show that equilibrium selection depends on the variance of the initial strategy distribution, or initial population heterogeneity. We give an example of a 2Ã2 game in which heterogeneity persists even as the mean population state converges to a mixed equilibrium. Finally, we apply the dynamic to negative definite and doubly symmetric games.

âº Agents in a population game revise mixed strategies using reinforcement learning. âº The strategy distribution evolves according to the replicator continuity equation. âº The replicator dynamic is a special case when all agents use same mixed strategy. âº Equilibrium selection depends on the variance of the initial strategy distribution. âº Even if average strategy approaches Nash equilibrium, variance may persist.

Keywords

C72 C73 Replicator dynamics Continuity equation Reinforcement learning