Average-Reward Decentralized Markov Decision Processes

Marek Petrik and Shlomo Zilberstein. Average-Reward Decentralized Markov Decision Processes. Proceedings of the Twentieth International Joint Conference on Artificial Intelligence (IJCAI), 1997-2002, Hyderabad, India, 2007.

Abstract

Formal analysis of decentralized decision making has become a thriving research area in recent years, producing a number of multi-agent extensions of Markov decision processes. While much of the work has focused on optimizing discounted cumulative reward, optimizing average reward is sometimes a more suitable criterion. We formalize a class of such problems and analyze its characteristics, showing that it is NP complete and that optimal policies are deterministic. Our analysis lays the foundation for designing two optimal algorithms. Experimental results with a standard problem from the literature illustrate the applicability of these solution techniques.

Bibtex entry:

@inproceedings{PZijcai07,
  author	= {Marek Petrik and Shlomo Zilberstein},
  title		= {Average-Reward Decentralized {M}arkov Decision Processes},
  booktitle     = {Proceedings of the Twentieth International Joint Conference on
		   Artificial Intelligence},
  year		= {2007},
  pages		= {1997-2002},
  address       = {Hyderabad, India},
  url		= {http://rbr.cs.umass.edu/shlomo/papers/PZijcai07.html}
}

shlomo@cs.umass.edu
UMass Amherst