Sciweavers

1236 search results - page 56 / 248
» Opposition-Based Reinforcement Learning
Sort
View
ECML
2006
Springer
14 years 1 months ago
Scaling Model-Based Average-Reward Reinforcement Learning for Product Delivery
Reinforcement learning in real-world domains suffers from three curses of dimensionality: explosions in state and action spaces, and high stochasticity. We present approaches that ...
Scott Proper, Prasad Tadepalli
ML
1998
ACM
136views Machine Learning» more  ML 1998»
13 years 9 months ago
Co-Evolution in the Successful Learning of Backgammon Strategy
Following Tesauro’s work on TD-Gammon, we used a 4000 parameter feed-forward neural network to develop a competitive backgammon evaluation function. Play proceeds by a roll of t...
Jordan B. Pollack, Alan D. Blair
MAGS
2010
81views more  MAGS 2010»
13 years 4 months ago
Task allocation learning in a multiagent environment: Application to the RoboCupRescue simulation
Coordinating agents in a complex environment is a hard problem, but it can become even harder when certain characteristics of the tasks, like the required number of agents, are un...
Sébastien Paquet, Brahim Chaib-draa, Patric...
PRIMA
2009
Springer
14 years 4 months ago
Recursive Adaptation of Stepsize Parameter for Non-stationary Environments
In this article, we propose a method to adapt stepsize parameters used in reinforcement learning for dynamic environments. In general reinforcement learning situations, a stepsize...
Itsuki Noda
ACL
2010
13 years 8 months ago
Reading between the Lines: Learning to Map High-Level Instructions to Commands
In this paper, we address the task of mapping high-level instructions to sequences of commands in an external environment. Processing these instructions is challenging--they posit...
S. R. K. Branavan, Luke S. Zettlemoyer, Regina Bar...