Sciweavers

93 search results - page 11 / 19
» Learning to overtake in TORCS using simple reinforcement lea...
Sort
View
ATAL
2007
Springer
14 years 1 months ago
Batch reinforcement learning in a complex domain
Temporal difference reinforcement learning algorithms are perfectly suited to autonomous agents because they learn directly from an agent’s experience based on sequential actio...
Shivaram Kalyanakrishnan, Peter Stone
IJCAI
2001
13 years 9 months ago
Rational and Convergent Learning in Stochastic Games
This paper investigates the problem of policy learning in multiagent environments using the stochastic game framework, which we briefly overview. We introduce two properties as de...
Michael H. Bowling, Manuela M. Veloso
ICML
2002
IEEE
14 years 8 months ago
Algorithm-Directed Exploration for Model-Based Reinforcement Learning in Factored MDPs
One of the central challenges in reinforcement learning is to balance the exploration/exploitation tradeoff while scaling up to large problems. Although model-based reinforcement ...
Carlos Guestrin, Relu Patrascu, Dale Schuurmans
HEURISTICS
2008
170views more  HEURISTICS 2008»
13 years 7 months ago
Accelerating autonomous learning by using heuristic selection of actions
This paper investigates how to make improved action selection for online policy learning in robotic scenarios using reinforcement learning (RL) algorithms. Since finding control po...
Reinaldo A. C. Bianchi, Carlos H. C. Ribeiro, Anna...
ML
1998
ACM
136views Machine Learning» more  ML 1998»
13 years 7 months ago
Co-Evolution in the Successful Learning of Backgammon Strategy
Following Tesauro’s work on TD-Gammon, we used a 4000 parameter feed-forward neural network to develop a competitive backgammon evaluation function. Play proceeds by a roll of t...
Jordan B. Pollack, Alan D. Blair