Search Sciweavers | Sciweavers

1138 search results - page 160 / 228

» Feature Markov Decision Processes

157

Voted

ICML
2007
IEEE

200views Machine Learning» more ICML 2007»

Multi-task reinforcement learning: a hierarchical Bayesian approach

16 years 4 months ago

Download www.machinelearning.org

We consider the problem of multi-task reinforcement learning, where the agent needs to solve a sequence of Markov Decision Processes (MDPs) chosen randomly from a fixed but unknow...

Aaron Wilson, Alan Fern, Soumya Ray, Prasad Tadepa...

claim paper

Read More »

125

click to vote

ICML
2008
IEEE

122views Machine Learning» more ICML 2008»

Reinforcement learning in the presence of rare events

16 years 4 months ago

Download www.ece.mcgill.ca

We consider the task of reinforcement learning in an environment in which rare significant events occur independently of the actions selected by the controlling agent. If these ev...

Jordan Frank, Shie Mannor, Doina Precup

claim paper

Read More »

125

click to vote

ICML
2006
IEEE

103views Machine Learning» more ICML 2006»

Using inaccurate models in reinforcement learning

16 years 4 months ago

Download ai.stanford.edu

In the model-based policy search approach to reinforcement learning (RL), policies are found using a model (or "simulator") of the Markov decision process. However, for ...

Pieter Abbeel, Morgan Quigley, Andrew Y. Ng

claim paper

Read More »

141

click to vote

ICML
2006
IEEE

136views Machine Learning» more ICML 2006»

An analytic solution to discrete Bayesian reinforcement learning

16 years 4 months ago

Download www.cs.uwaterloo.ca

Reinforcement learning (RL) was originally proposed as a framework to allow agents to learn in an online fashion as they interact with their environment. Existing RL algorithms co...

Pascal Poupart, Nikos A. Vlassis, Jesse Hoey, Kevi...

claim paper

Read More »

124

click to vote

ICML
2003
IEEE

121views Machine Learning» more ICML 2003»

Planning in the Presence of Cost Functions Controlled by an Adversary

16 years 4 months ago

Download www.cs.cmu.edu

We investigate methods for planning in a Markov Decision Process where the cost function is chosen by an adversary after we fix our policy. As a running example, we consider a rob...

H. Brendan McMahan, Geoffrey J. Gordon, Avrim Blum

claim paper

Read More »

« Prev « First page 160 / 228 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers