Sciweavers

272 search results - page 15 / 55
» Parallel Reinforcement Learning with Linear Function Approxi...
Sort
View
AAAI
2008
13 years 9 months ago
Adaptive Importance Sampling with Automatic Model Selection in Value Function Approximation
Off-policy reinforcement learning is aimed at efficiently reusing data samples gathered in the past, which is an essential problem for physically grounded AI as experiments are us...
Hirotaka Hachiya, Takayuki Akiyama, Masashi Sugiya...
ECML
2006
Springer
13 years 11 months ago
Scaling Model-Based Average-Reward Reinforcement Learning for Product Delivery
Reinforcement learning in real-world domains suffers from three curses of dimensionality: explosions in state and action spaces, and high stochasticity. We present approaches that ...
Scott Proper, Prasad Tadepalli
IROS
2006
IEEE
190views Robotics» more  IROS 2006»
14 years 1 months ago
Q-RAN: A Constructive Reinforcement Learning Approach for Robot Behavior Learning
Abstract— This paper presents a learning system that uses Qlearning with a resource allocating network (RAN) for behavior learning in mobile robotics. The RAN is used as a functi...
Jun Li, Achim J. Lilienthal, Tomás Mart&iac...
ICML
2003
IEEE
14 years 8 months ago
Action Elimination and Stopping Conditions for Reinforcement Learning
We consider incorporating action elimination procedures in reinforcement learning algorithms. We suggest a framework that is based on learning an upper and a lower estimates of th...
Eyal Even-Dar, Shie Mannor, Yishay Mansour
IJCAI
2007
13 years 9 months ago
Reinforcement Learning of Local Shape in the Game of Go
We explore an application to the game of Go of a reinforcement learning approach based on a linear evaluation function and large numbers of binary features. This strategy has prov...
David Silver, Richard S. Sutton, Martin Mülle...