Search Sciweavers | Sciweavers

215 search results - page 11 / 43

» Model-Based Reinforcement Learning with Continuous States an...

191

click to vote

SCIA
2005
Springer

211views Image Analysis» more SCIA 2005»

Perception-Action Based Object Detection from Local Descriptor Combination and Reinforcement Learning

15 years 11 months ago

Download www.mobvis.org

This work proposes to learn visual encodings of attention patterns that enables sequential attention for object detection in real world environments. The system embeds a saccadic d...

Lucas Paletta, Gerald Fritz, Christin Seifert

claim paper

Read More »

169

click to vote

ICMLA
2010

205views Machine Learning» more ICMLA 2010»

Incremental Learning of Relational Action Rules

15 years 3 months ago

Download www-lipn.univ-paris13.fr

Abstract--In the Relational Reinforcement learning framework, we propose an algorithm that learns an action model allowing to predict the resulting state of each action in any give...

Christophe Rodrigues, Pierre Gérard, C&eacu...

claim paper

Read More »

165

click to vote

ICML
2002
IEEE

156views Machine Learning» more ICML 2002»

Algorithm-Directed Exploration for Model-Based Reinforcement Learning in Factored MDPs

16 years 6 months ago

Download select.cs.cmu.edu

One of the central challenges in reinforcement learning is to balance the exploration/exploitation tradeoff while scaling up to large problems. Although model-based reinforcement ...

Carlos Guestrin, Relu Patrascu, Dale Schuurmans

claim paper

Read More »

167

click to vote

ROBOCUP
2007
Springer

167views Robotics» more ROBOCUP 2007»

Cooperative/Competitive Behavior Acquisition Based on State Value Estimation of Others

15 years 12 months ago

Download www.er.ams.eng.osaka-u.ac.jp

The existing reinforcement learning approaches have been suﬀering from the curse of dimension problem when they are applied to multiagent dynamic environments. One of the typical...

Kentarou Noma, Yasutake Takahashi, Minoru Asada

claim paper

Read More »

158

click to vote

ECML
2007
Springer

192views Machine Learning» more ECML 2007»

Policy Gradient Critics

15 years 12 months ago

Download www.idsia.ch

We present Policy Gradient Actor-Critic (PGAC), a new model-free Reinforcement Learning (RL) method for creating limited-memory stochastic policies for Partially Observable Markov ...

Daan Wierstra, Jürgen Schmidhuber

claim paper

Read More »

« Prev « First page 11 / 43 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers