Sciweavers

779 search results - page 21 / 156
» Reinforcement Using Supervised Learning for Policy Generaliz...
Sort
View
ECML
2007
Springer
14 years 1 months ago
Policy Gradient Critics
We present Policy Gradient Actor-Critic (PGAC), a new model-free Reinforcement Learning (RL) method for creating limited-memory stochastic policies for Partially Observable Markov ...
Daan Wierstra, Jürgen Schmidhuber
ICANNGA
2007
Springer
105views Algorithms» more  ICANNGA 2007»
14 years 1 months ago
Reinforcement Learning in Fine Time Discretization
Reinforcement Learning (RL) is analyzed here as a tool for control system optimization. State and action spaces are assumed to be continuous. Time is assumed to be discrete, yet th...
Pawel Wawrzynski
FLAIRS
2006
13 years 9 months ago
Using Active Relocation to Aid Reinforcement Learning
We propose a new framework for aiding a reinforcement learner by allowing it to relocate, or move, to a state it selects so as to decrease the number of steps it needs to take in ...
Lilyana Mihalkova, Raymond J. Mooney
ICML
2006
IEEE
14 years 8 months ago
Using inaccurate models in reinforcement learning
In the model-based policy search approach to reinforcement learning (RL), policies are found using a model (or "simulator") of the Markov decision process. However, for ...
Pieter Abbeel, Morgan Quigley, Andrew Y. Ng
MICAI
2009
Springer
14 years 2 months ago
A Two-Stage Relational Reinforcement Learning with Continuous Actions for Real Service Robots
Reinforcement Learning is a commonly used technique in robotics, however, traditional algorithms are unable to handle large amounts of data coming from the robot’s sensors, requi...
Julio H. Zaragoza, Eduardo F. Morales