Bounded Dynamic Programming for Decentralized POMDPs

Christopher Amato, Alan Carlin, and Shlomo Zilberstein. Bounded Dynamic Programming for Decentralized POMDPs. Proceedings of the AAMAS Workshop on Multi-Agent Sequential Decision Making in Uncertain Domains (MSDM), Honolulu, Hawaii, May, 2007.

Abstract

Solving decentralized POMDPs (DEC-POMDPs) optimally is a very hard problem. As a result, several approximate algorithms have been developed, but these do not have satisfactory error bounds. In this paper, we first discuss optimal dynamic programming and some approximate finite horizon DEC-POMDP algorithms. We then present a bounded dynamic programming algorithm. Given a problem and an error bound, the algorithm will return a solution within that bound when it is able to solve the problem. We give a proof of this bound and provide some experimental results showing high quality solutions to large DEC-POMDPs for large horizons.

Bibtex entry:

@inproceedings{ACZmsdm07,
  author	= {Christopher Amato and Alan Carlin and Shlomo Zilberstein},
  title		= {Bounded Dynamic Programming for Decentralized {POMDP}s},
  booktitle     = {Proceedings of the {AAMAS} Workshop on Multi-Agent 
Sequential Decision Making in Uncertain Domains},
  year		= {2007},
  pages		= {},
  address       = {Honolulu, Hawaii},
  url		= {http://rbr.cs.umass.edu/shlomo/papers/ACZmsdm07.html}
}

shlomo@cs.umass.edu
UMass Amherst