Influence Diagrams with Memory States: Representation and Algorithms

Xiaojian Wu, Akshat Kumar, and Shlomo Zilberstein. Influence Diagrams with Memory States: Representation and Algorithms. Proceedings of the Second International Conference on Algorithmic Decision Theory (ADT), Rutgers University, 2011.

Abstract

Influence diagrams (IDs) offer a powerful framework for decision making under uncertainty, but their applicability has been hindered by the exponential growth of runtime and memory usage--largely due to the no-forgetting assumption. We present a novel way to maintain a limited amount of memory to inform each decision and still obtain near-optimal policies. The approach is based on augmenting the graphical model with memory states that represent key aspects of previous observations--a method that has proved useful in POMDP solvers. We also derive an efficient EM-based message-passing algorithm to compute the policy. Experimental results show that this approach produces high-quality approximate polices and offers better scalability than existing methods.

Bibtex entry:

@inproceedings{WKZadt11,
  author	= {Xiaojian Wu and Akshat Kumar and Shlomo Zilberstein},
  title		= {Influence Diagrams with Memory States: Representation and Algorithms},
  booktitle     = {Proceedings of the Second International Conference on Algorithmic 
                   Decision Theory},
  year		= {2011},
  pages		= {},
  address       = {Rutgers University, USA},
  url		= {http://rbr.cs.umass.edu/shlomo/papers/WKZadt11.html}
}

shlomo@cs.umass.edu
UMass Amherst