Sciweavers

SARA
2007
Springer

Active Learning of Dynamic Bayesian Networks in Markov Decision Processes

14 years 5 months ago
Active Learning of Dynamic Bayesian Networks in Markov Decision Processes
Several recent techniques for solving Markov decision processes use dynamic Bayesian networks to compactly represent tasks. The dynamic Bayesian network representation may not be given, in which case it is necessary to learn it if one wants to apply these techniques. We develop an algorithm for learning dynamic Bayesian network representations of Markov decision processes using data collected through exploration in the environment. To accelerate data collection we develop a novel scheme for active learning of the networks. We assume that it is not possible to sample the process in arbitrary states, only along trajectories, which prevents us from applying existing active learning techniques. Our active learning scheme selects actions that maximize the total entropy of distributions used to evaluate potential refinements of the networks.
Anders Jonsson, Andrew G. Barto
Added 09 Jun 2010
Updated 09 Jun 2010
Type Conference
Year 2007
Where SARA
Authors Anders Jonsson, Andrew G. Barto
Comments (0)