Sciweavers

536 search results - page 32 / 108
» Residual Algorithms: Reinforcement Learning with Function Ap...
Sort
View
NIPS
2001
13 years 9 months ago
The Steering Approach for Multi-Criteria Reinforcement Learning
We consider the problem of learning to attain multiple goals in a dynamic environment, which is initially unknown. In addition, the environment may contain arbitrarily varying ele...
Shie Mannor, Nahum Shimkin
GECCO
2009
Springer
124views Optimization» more  GECCO 2009»
14 years 9 days ago
Reinforcement learning for games: failures and successes
We apply CMA-ES, an evolution strategy with covariance matrix adaptation, and TDL (Temporal Difference Learning) to reinforcement learning tasks. In both cases these algorithms se...
Wolfgang Konen, Thomas Bartz-Beielstein
ICASSP
2011
IEEE
12 years 11 months ago
Image prediction based on non-negative matrix factorization
This paper presents a novel spatial texture prediction method based on non-negative matrix factorization. As an extension of template matching, approximation based iterative textu...
Mehmet Türkan, Christine Guillemot
ROBOCUP
2007
Springer
102views Robotics» more  ROBOCUP 2007»
14 years 1 months ago
Heuristic Reinforcement Learning Applied to RoboCup Simulation Agents
This paper describes the design and implementation of robotic agents for the RoboCup Simulation 2D category that learns using a recently proposed Heuristic Reinforcement Learning a...
Luiz A. Celiberto, Carlos H. C. Ribeiro, Anna Hele...
ECML
2006
Springer
13 years 11 months ago
Scaling Model-Based Average-Reward Reinforcement Learning for Product Delivery
Reinforcement learning in real-world domains suffers from three curses of dimensionality: explosions in state and action spaces, and high stochasticity. We present approaches that ...
Scott Proper, Prasad Tadepalli