Search Sciweavers | Sciweavers

286 search results - page 10 / 58

» Using inaccurate models in reinforcement learning

click to vote

EWRL
2008

186views Machine Learning» more EWRL 2008»

Efficient Reinforcement Learning in Parameterized Models: Discrete Parameter Case

13 years 9 months ago

Download webee.technion.ac.il

We consider reinforcement learning in the parameterized setup, where the model is known to belong to a parameterized family of Markov Decision Processes (MDPs). We further impose ...

Kirill Dyagilev, Shie Mannor, Nahum Shimkin

claim paper

Read More »

click to vote

AROBOTS
1999

104views more AROBOTS 1999»

Reinforcement Learning Soccer Teams with Incomplete World Models

13 years 7 months ago

Download igitur-archive.library.uu.nl

We use reinforcement learning (RL) to compute strategies for multiagent soccer teams. RL may pro t signi cantly from world models (WMs) estimating state transition probabilities an...

Marco Wiering, Rafal Salustowicz, Jürgen Schm...

claim paper

Read More »

click to vote

ICML
2002
IEEE

156views Machine Learning» more ICML 2002»

Algorithm-Directed Exploration for Model-Based Reinforcement Learning in Factored MDPs

14 years 8 months ago

Download select.cs.cmu.edu

One of the central challenges in reinforcement learning is to balance the exploration/exploitation tradeoff while scaling up to large problems. Although model-based reinforcement ...

Carlos Guestrin, Relu Patrascu, Dale Schuurmans

claim paper

Read More »

click to vote

ICML
2005
IEEE

127views Machine Learning» more ICML 2005»

Exploration and apprenticeship learning in reinforcement learning

14 years 8 months ago

Download ai.stanford.edu

We consider reinforcement learning in systems with unknown dynamics. Algorithms such as E3 (Kearns and Singh, 2002) learn near-optimal policies by using "exploration policies...

Pieter Abbeel, Andrew Y. Ng

claim paper

Read More »

click to vote

NIPS
2003

105views Information Technology» more NIPS 2003»

Gaussian Processes in Reinforcement Learning

13 years 9 months ago

Download books.nips.cc

We exploit some useful properties of Gaussian process (GP) regression models for reinforcement learning in continuous state spaces and discrete time. We demonstrate how the GP mod...

Carl Edward Rasmussen, Malte Kuss

claim paper

Read More »

« Prev « First page 10 / 58 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers