Sciweavers

4544 search results - page 189 / 909
» Reinforcement Learning with Time
Sort
View
ML
1998
ACM
136views Machine Learning» more  ML 1998»
13 years 10 months ago
Co-Evolution in the Successful Learning of Backgammon Strategy
Following Tesauro’s work on TD-Gammon, we used a 4000 parameter feed-forward neural network to develop a competitive backgammon evaluation function. Play proceeds by a roll of t...
Jordan B. Pollack, Alan D. Blair
MAGS
2010
81views more  MAGS 2010»
13 years 5 months ago
Task allocation learning in a multiagent environment: Application to the RoboCupRescue simulation
Coordinating agents in a complex environment is a hard problem, but it can become even harder when certain characteristics of the tasks, like the required number of agents, are un...
Sébastien Paquet, Brahim Chaib-draa, Patric...
ICML
2002
IEEE
14 years 11 months ago
Learning from Scarce Experience
Searching the space of policies directly for the optimal policy has been one popular method for solving partially observable reinforcement learning problems. Typically, with each ...
Leonid Peshkin, Christian R. Shelton
FBIT
2007
IEEE
14 years 4 months ago
Learning to Drive a Real Car in 20 Minutes
The paper describes our first experiments on Reinforcement Learning to steer a real robot car. The applied method, Neural Fitted Q Iteration (NFQ) is purely data-driven based on ...
Martin Riedmiller, Michael Montemerlo, Hendrik Dah...
ROMAN
2007
IEEE
150views Robotics» more  ROMAN 2007»
14 years 4 months ago
Asymmetric Interpretations of Positive and Negative Human Feedback for a Social Learning Agent
— The ability for people to interact with robots and teach them new skills will be crucial to the successful application of robots in everyday human environments. In order to des...
Andrea Lockerd Thomaz, Cynthia Breazeal