Search Sciweavers | Sciweavers

34 search results - page 4 / 7

» Towards Finite-Sample Convergence of Direct Reinforcement Le...

258

click to vote

WAPCV
2007
Springer

188views Computer Vision» more WAPCV 2007»

Reinforcement Learning for Decision Making in Sequential Visual Attention

16 years 1 months ago

Download www.mobvis.org

The innovation of this work is the provision of a system that learns visual encodings of attention patterns and that enables sequential attention for object detection in real world...

Lucas Paletta, Gerald Fritz

claim paper

Read More »

250

click to vote

ECML
2007
Springer

167views Machine Learning» more ECML 2007»

Efficient Continuous-Time Reinforcement Learning with Adaptive State Graphs

15 years 11 months ago

Download www.igi.tugraz.at

Abstract. We present a new reinforcement learning approach for deterministic continuous control problems in environments with unknown, arbitrary reward functions. The difficulty of...

Gerhard Neumann, Michael Pfeiffer, Wolfgang Maass

claim paper

Read More »

216

click to vote

GECCO
2004
Springer

155views Optimization» more GECCO 2004»

Genetic Network Programming with Reinforcement Learning and Its Performance Evaluation

16 years 29 days ago

Download www.cs.york.ac.uk

A new graph-based evolutionary algorithm named “Genetic Network Programming, GNP” has been proposed. GNP represents its solutions as directed graph structures, which can improv...

Shingo Mabu, Kotaro Hirasawa, Jinglu Hu

claim paper

Read More »

204

click to vote

NECO
2007

150views more NECO 2007»

Reinforcement Learning, Spike-Time-Dependent Plasticity, and the BCM Rule

15 years 7 months ago

Download eprints.pascal-network.org

Learning agents, whether natural or artiﬁcial, must update their internal parameters in order to improve their behavior over time. In reinforcement learning, this plasticity is ...

Dorit Baras, Ron Meir

claim paper

Read More »

224

click to vote

UAI
2003

172views Artificial Intelligence» more UAI 2003»

On the Convergence of Bound Optimization Algorithms

15 years 9 months ago

Download cs.nyu.edu

Many practitioners who use EM and related algorithms complain that they are sometimes slow. When does this happen, and what can be done about it? In this paper, we study the gener...

Ruslan Salakhutdinov, Sam T. Roweis, Zoubin Ghahra...

claim paper

Read More »

« Prev « First page 4 / 7 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers