Search Sciweavers | Sciweavers

377 search results - page 30 / 76

» Optimizing Production Manufacturing Using Reinforcement Lear...

233

click to vote

WAPCV
2007
Springer

188views Computer Vision» more WAPCV 2007»

Reinforcement Learning for Decision Making in Sequential Visual Attention

16 years 27 days ago

Download www.mobvis.org

The innovation of this work is the provision of a system that learns visual encodings of attention patterns and that enables sequential attention for object detection in real world...

Lucas Paletta, Gerald Fritz

claim paper

Read More »

176

click to vote

NIPS
1998

137views Information Technology» more NIPS 1998»

Risk Sensitive Reinforcement Learning

15 years 8 months ago

Download www.cs.cmu.edu

In this paper, we consider Markov Decision Processes (MDPs) with error states. Error states are those states entering which is undesirable or dangerous. We define the risk with re...

Ralph Neuneier, Oliver Mihatsch

claim paper

Read More »

187

click to vote

ICRA
2009
IEEE

227views Robotics» more ICRA 2009»

Adaptive autonomous control using online value iteration with gaussian processes

16 years 1 months ago

Download www-personal.acfr.usyd.edu.au

— In this paper, we present a novel approach to controlling a robotic system online from scratch based on the reinforcement learning principle. In contrast to other approaches, o...

Axel Rottmann, Wolfram Burgard

claim paper

Read More »

156

click to vote

NIPS
1996

117views Information Technology» more NIPS 1996»

Reinforcement Learning for Mixed Open-loop and Closed-loop Control

15 years 8 months ago

Download anytime.cs.umass.edu

Closed-loop control relies on sensory feedback that is usually assumed to be free. But if sensing incurs a cost, it may be coste ective to take sequences of actions in open-loop m...

Eric A. Hansen, Andrew G. Barto, Shlomo Zilberstei...

claim paper

Read More »

165

click to vote

ICML
2002
IEEE

127views Machine Learning» more ICML 2002»

Action Refinement in Reinforcement Learning by Probability Smoothing

16 years 7 months ago

Download www.cs.berkeley.edu

In many reinforcement learning applications, the set of possible actions can be partitioned by the programmer into subsets of similar actions. This paper presents a technique for ...

Carles Sierra, Dídac Busquets, Ramon L&oacu...

claim paper

Read More »

« Prev « First page 30 / 76 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers