Search Sciweavers | Sciweavers

181 search results - page 25 / 37

» On Policy Learning in Restricted Policy Spaces

click to vote

IROS
2008
IEEE

203views Robotics» more IROS 2008»

Learning equivalent action choices from demonstration

14 years 2 months ago

Download www.cs.cmu.edu

Abstract— In their interactions with the world robots inevitably face equivalent action choices, situations in which multiple actions are equivalently applicable. In this paper, ...

Sonia Chernova, Manuela M. Veloso

claim paper

Read More »

click to vote

GECCO
2000
Springer

143views Optimization» more GECCO 2000»

A Genetic Algorithm for Automatically Designing Modular Reinforcement Learning Agents

13 years 11 months ago

Download www.cs.bham.ac.uk

Reinforcement learning (RL) is one of the machine learning techniques and has been received much attention as a new self-adaptive controller for various systems. The RL agent auto...

Isao Ono, Tetsuo Nijo, Norihiko Ono

claim paper

Read More »

click to vote

ICML
2010
IEEE

219views Machine Learning» more ICML 2010»

Convergence of Least Squares Temporal Difference Methods Under General Conditions

13 years 8 months ago

Download www.cs.helsinki.fi

We consider approximate policy evaluation for finite state and action Markov decision processes (MDP) in the off-policy learning context and with the simulation-based least square...

Huizhen Yu

claim paper

Read More »

click to vote

AIPS
2007

174views Artificial Intelligence» more AIPS 2007»

Learning to Plan Using Harmonic Analysis of Diffusion Models

13 years 10 months ago

Download www.cs.umass.edu

This paper summarizes research on a new emerging framework for learning to plan using the Markov decision process model (MDP). In this paradigm, two approaches to learning to plan...

Sridhar Mahadevan, Sarah Osentoski, Jeffrey Johns,...

claim paper

Read More »

click to vote

NIPS
1993

86views Information Technology» more NIPS 1993»

Robust Reinforcement Learning in Motion Planning

13 years 9 months ago

Download www.cs.cmu.edu

While exploring to nd better solutions, an agent performing online reinforcement learning (RL) can perform worse than is acceptable. In some cases, exploration might have unsafe, ...

Satinder P. Singh, Andrew G. Barto, Roderic A. Gru...

claim paper

Read More »

« Prev « First page 25 / 37 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers