Learning first-order Markov models for control

14 years 2 months ago

Download books.nips.cc

First-order Markov models have been successfully applied to many problems, for example in modeling sequential data using Markov chains, and modeling control problems using the Markov decision processes (MDP) formalism. If a first-order Markov model's parameters are estimated from data, the standard maximum likelihood estimator considers only the first-order (single-step) transitions. But for many problems, the firstorder conditional independence assumptions are not satisfied, and as a result the higher order transition probabilities may be poorly approximated. Motivated by the problem of learning an MDP's parameters for control, we propose an algorithm for learning a first-order Markov model that explicitly takes into account higher order interactions during training. Our algorithm uses an optimization criterion different from maximum likelihood, and allows us to learn models that capture longer range effects, but without giving up the benefits of using first-order Markov mo...

Pieter Abbeel, Andrew Y. Ng

Real-time Traffic

First-order Markov Models | Markov Model Parameters | Maximum Likelihood | NIPS 2004 | NIPS 2007 |

claim paper

Post Info
More Details (n/a)

Added	31 Oct 2010
Updated	31 Oct 2010
Type	Conference
Year	2004
Where	NIPS
Authors	Pieter Abbeel, Andrew Y. Ng

Comments (0)

Sciweavers

Learning first-order Markov models for control

First-order Markov Models | Markov Model Parameters | Maximum Likelihood | NIPS 2004 | NIPS 2007 |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers