Search Sciweavers | Sciweavers

536 search results - page 32 / 108

» Residual Algorithms: Reinforcement Learning with Function Ap...

212

click to vote

NIPS
2001

131views Information Technology» more NIPS 2001»

The Steering Approach for Multi-Criteria Reinforcement Learning

15 years 8 months ago

Download books.nips.cc

We consider the problem of learning to attain multiple goals in a dynamic environment, which is initially unknown. In addition, the environment may contain arbitrarily varying ele...

Shie Mannor, Nahum Shimkin

claim paper

Read More »

210

click to vote

GECCO
2009
Springer

124views Optimization» more GECCO 2009»

Reinforcement learning for games: failures and successes

15 years 11 months ago

Download www.gm.fh-koeln.de

We apply CMA-ES, an evolution strategy with covariance matrix adaptation, and TDL (Temporal Difference Learning) to reinforcement learning tasks. In both cases these algorithms se...

Wolfgang Konen, Thomas Bartz-Beielstein

claim paper

Read More »

186

click to vote

ICASSP
2011
IEEE

155views Signal Processing» more ICASSP 2011»

Image prediction based on non-negative matrix factorization

14 years 10 months ago

Download mirlab.org

This paper presents a novel spatial texture prediction method based on non-negative matrix factorization. As an extension of template matching, approximation based iterative textu...

Mehmet Türkan, Christine Guillemot

claim paper

Read More »

166

click to vote

ROBOCUP
2007
Springer

102views Robotics» more ROBOCUP 2007»

Heuristic Reinforcement Learning Applied to RoboCup Simulation Agents

16 years 1 months ago

Download www.fei.edu.br

This paper describes the design and implementation of robotic agents for the RoboCup Simulation 2D category that learns using a recently proposed Heuristic Reinforcement Learning a...

Luiz A. Celiberto, Carlos H. C. Ribeiro, Anna Hele...

claim paper

Read More »

197

click to vote

ECML
2006
Springer

116views Machine Learning» more ECML 2006»

Scaling Model-Based Average-Reward Reinforcement Learning for Product Delivery

15 years 10 months ago

Download web.engr.oregonstate.edu

Reinforcement learning in real-world domains suffers from three curses of dimensionality: explosions in state and action spaces, and high stochasticity. We present approaches that ...

Scott Proper, Prasad Tadepalli

claim paper

Read More »

« Prev « First page 32 / 108 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers