Search Sciweavers | Sciweavers

509 search results - page 17 / 102

» Compositional Models for Reinforcement Learning

239

Voted

NIPS
2007

207views Information Technology» more NIPS 2007»

Bayes-Adaptive POMDPs

15 years 8 months ago

Download books.nips.cc

Bayesian Reinforcement Learning has generated substantial interest recently, as it provides an elegant solution to the exploration-exploitation trade-off in reinforcement learning...

Stéphane Ross, Brahim Chaib-draa, Joelle Pi...

claim paper

Read More »

166

click to vote

CIIA
2009

208views Information Technology» more CIIA 2009»

Dynamic Scheduling in Petroleum Process using Reinforcement Learning

15 years 7 months ago

Download sunsite.informatik.rwth-aachen.de

Petroleum industry production systems are highly automatized. In this industry, all functions (e.g., planning, scheduling and maintenance) are automated and in order to remain comp...

Nassima Aissani, Bouziane Beldjilali

claim paper

Read More »

178

Voted

ICML
1999
IEEE

129views Machine Learning» more ICML 1999»

Implicit Imitation in Multiagent Reinforcement Learning

16 years 7 months ago

Download www.cs.toronto.edu

Imitation is actively being studied as an effective means of learning in multi-agent environments. It allows an agent to learn how to act well (perhaps optimally) by passively obs...

Bob Price, Craig Boutilier

claim paper

Read More »

177

click to vote

IJCAI
2003

188views Artificial Intelligence» more IJCAI 2003»

A Bayesian Approach to Imitation in Reinforcement Learning

15 years 8 months ago

Download ijcai.org

In multiagent environments, forms of social learning such as teaching and imitation have been shown to aid the transfer of knowledge from experts to learners in reinforcement lear...

Bob Price, Craig Boutilier

claim paper

Read More »

172

click to vote

ICML
2006
IEEE

131views Machine Learning» more ICML 2006»

PAC model-free reinforcement learning

16 years 7 months ago

Download cseweb.ucsd.edu

For a Markov Decision Process with finite state (size S) and action spaces (size A per state), we propose a new algorithm--Delayed Q-Learning. We prove it is PAC, achieving near o...

Alexander L. Strehl, Lihong Li, Eric Wiewiora, Joh...

claim paper

Read More »

« Prev « First page 17 / 102 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers