Search Sciweavers | Sciweavers

473 search results - page 27 / 95

» Optimal policy switching algorithms for reinforcement learni...

175

click to vote

ILP
2007
Springer

250views Automated Reasoning» more ILP 2007»

Learning Relational Options for Inductive Transfer in Relational Reinforcement Learning

16 years 3 days ago

Download people.cs.kuleuven.be

In reinforcement learning problems, an agent has the task of learning a good or optimal strategy from interaction with his environment. At the start of the learning task, the agent...

Tom Croonenborghs, Kurt Driessens, Maurice Bruynoo...

claim paper

Read More »

170

Voted

ICML
2001
IEEE

185views Machine Learning» more ICML 2001»

Off-Policy Temporal Difference Learning with Function Approximation

16 years 6 months ago

Download www.cs.ualberta.ca

We introduce the first algorithm for off-policy temporal-difference learning that is stable with linear function approximation. Off-policy learning is of interest because it forms...

Doina Precup, Richard S. Sutton, Sanjoy Dasgupta

claim paper

Read More »

154

click to vote

ICML
2006
IEEE

103views Machine Learning» more ICML 2006»

Using inaccurate models in reinforcement learning

16 years 6 months ago

Download ai.stanford.edu

In the model-based policy search approach to reinforcement learning (RL), policies are found using a model (or "simulator") of the Markov decision process. However, for ...

Pieter Abbeel, Morgan Quigley, Andrew Y. Ng

claim paper

Read More »

147

click to vote

NIPS
2003

108views Information Technology» more NIPS 2003»

Policy Search by Dynamic Programming

15 years 7 months ago

Download books.nips.cc

We consider the policy search approach to reinforcement learning. We show that if a “baseline distribution” is given (indicating roughly how often we expect a good policy to v...

J. Andrew Bagnell, Sham Kakade, Andrew Y. Ng, Jeff...

claim paper

Read More »

155

click to vote

EURONGI
2005
Springer

115views Computer Networks» more EURONGI 2005»

An Afterstates Reinforcement Learning Approach to Optimize Admission Control in Mobile Cellular Networks

15 years 11 months ago

Download jogiguz.webs.upv.es

We deploy a novel Reinforcement Learning optimization technique based on afterstates learning to determine the gain that can be achieved by incorporating movement prediction inform...

José Manuel Giménez-Guzmán, J...

claim paper

Read More »

« Prev « First page 27 / 95 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers