Search Sciweavers | Sciweavers

779 search results - page 21 / 156

» Reinforcement Using Supervised Learning for Policy Generaliz...

202

click to vote

ECML
2007
Springer

192views Machine Learning» more ECML 2007»

Policy Gradient Critics

16 years 1 months ago

Download www.idsia.ch

We present Policy Gradient Actor-Critic (PGAC), a new model-free Reinforcement Learning (RL) method for creating limited-memory stochastic policies for Partially Observable Markov ...

Daan Wierstra, Jürgen Schmidhuber

claim paper

Read More »

200

click to vote

ICANNGA
2007
Springer

105views Algorithms» more ICANNGA 2007»

Reinforcement Learning in Fine Time Discretization

16 years 1 months ago

Download staff.elka.pw.edu.pl

Reinforcement Learning (RL) is analyzed here as a tool for control system optimization. State and action spaces are assumed to be continuous. Time is assumed to be discrete, yet th...

Pawel Wawrzynski

claim paper

Read More »

168

click to vote

FLAIRS
2006

103views Artificial Intelligence» more FLAIRS 2006»

Using Active Relocation to Aid Reinforcement Learning

15 years 8 months ago

Download www.cs.utexas.edu

We propose a new framework for aiding a reinforcement learner by allowing it to relocate, or move, to a state it selects so as to decrease the number of steps it needs to take in ...

Lilyana Mihalkova, Raymond J. Mooney

claim paper

Read More »

191

Voted

ICML
2006
IEEE

103views Machine Learning» more ICML 2006»

Using inaccurate models in reinforcement learning

16 years 8 months ago

Download ai.stanford.edu

In the model-based policy search approach to reinforcement learning (RL), policies are found using a model (or "simulator") of the Markov decision process. However, for ...

Pieter Abbeel, Morgan Quigley, Andrew Y. Ng

claim paper

Read More »

192

click to vote

MICAI
2009
Springer

188views Artificial Intelligence» more MICAI 2009»

A Two-Stage Relational Reinforcement Learning with Continuous Actions for Real Service Robots

16 years 1 months ago

Download ccc.inaoep.mx

Reinforcement Learning is a commonly used technique in robotics, however, traditional algorithms are unable to handle large amounts of data coming from the robot’s sensors, requi...

Julio H. Zaragoza, Eduardo F. Morales

claim paper

Read More »

« Prev « First page 21 / 156 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers