DEC-POMDP overview

The decentralized partially observable Markov decision process (DEC-POMDP) framework is a model to represent multiple agents making decisions under uncertainty. It is an extension of the partially observable Markov decision process (POMDP) framework and a specific case of a partially observable stochastic game (POSG) (see Hansen, et al., 2004).

DEC-POMDPs and (PO)MDPs


For two agents, a DEC-POMDP can be described as follows:

M = &lang S, A1, A2, P, R, &Omega1, &Omega2, O, T &rang

A DEC-MDP is defined in the same way except:


"... there is currently no good way to combine game theoretic and POMDP control strategies." -- Russell and Norvig, 2003

DEC-POMDP algorithms

Optimal and approximate algorithms have been developed for general DEC-POMDPs and many subclasses. The worst case complexity of the general case has been shown to be NEXP-complete (Bernstein et al., 2002) with optimal solutions found by dynamic programming (Hansen et al., 2004) and approximate solutions may be found using bounded policy iteration (Bernstein et al., 2005). A reduction in complexity is seen when the agents are mostly independent (Becker et al., 2003) and communication can be explicitly modeled and analyzed (Goldman et al., 2004).


Home