Search Sciweavers | Sciweavers

238 search results - page 9 / 48

» Value-Function Approximations for Partially Observable Marko...

107

click to vote

COLT
2000
Springer

87views Machine Learning» more COLT 2000»

Estimation and Approximation Bounds for Gradient-Based Reinforcement Learning

15 years 7 months ago

Download www.cs.iastate.edu

We model reinforcement learning as the problem of learning to control a Partially Observable Markov Decision Process ( ¢¡¤£¦¥§ ), and focus on gradient ascent approache...

Peter L. Bartlett, Jonathan Baxter

claim paper

Read More »

157

click to vote

NIPS
2007

207views Information Technology» more NIPS 2007»

Bayes-Adaptive POMDPs

15 years 4 months ago

Download books.nips.cc

Bayesian Reinforcement Learning has generated substantial interest recently, as it provides an elegant solution to the exploration-exploitation trade-off in reinforcement learning...

Stéphane Ross, Brahim Chaib-draa, Joelle Pi...

claim paper

Read More »

154

Voted

AMAI
2004
Springer

164views Artificial Intelligence» more AMAI 2004»

A Framework for Sequential Planning in Multi-Agent Settings

15 years 8 months ago

Download www.jair.org

This paper extends the framework of partially observable Markov decision processes (POMDPs) to multi-agent settings by incorporating the notion of agent models into the state spac...

Piotr J. Gmytrasiewicz, Prashant Doshi

claim paper

Read More »

140

Voted

ICML
1999
IEEE

163views Machine Learning» more ICML 1999»

Monte Carlo Hidden Markov Models: Learning Non-Parametric Models of Partially Observable Stochastic Processes

16 years 4 months ago

Download www.ri.cmu.edu

We present a learning algorithm for non-parametric hidden Markov models with continuous state and observation spaces. All necessary probability densities are approximated using sa...

Sebastian Thrun, John Langford, Dieter Fox

claim paper

Read More »

132

Voted

NIPS
2004

125views Information Technology» more NIPS 2004»

VDCBPI: an Approximate Scalable Algorithm for Large POMDPs

15 years 4 months ago

Download books.nips.cc

Existing algorithms for discrete partially observable Markov decision processes can at best solve problems of a few thousand states due to two important sources of intractability:...

Pascal Poupart, Craig Boutilier

claim paper

Read More »

« Prev « First page 9 / 48 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers