Sciweavers

1262 search results - page 150 / 253
» Reinforcement Learning: An Introduction
Sort
View
ICML
2005
IEEE
16 years 4 months ago
Learning to compete, compromise, and cooperate in repeated general-sum games
Learning algorithms often obtain relatively low average payoffs in repeated general-sum games between other learning agents due to a focus on myopic best-response and one-shot Nas...
Jacob W. Crandall, Michael A. Goodrich
134
Voted
AAAI
2007
15 years 5 months ago
Active Imitation Learning
Imitation learning, also called learning by watching or programming by demonstration, has emerged as a means of accelerating many reinforcement learning tasks. Previous work has s...
Aaron P. Shon, Deepak Verma, Rajesh P. N. Rao
ICML
2001
IEEE
16 years 4 months ago
Off-Policy Temporal Difference Learning with Function Approximation
We introduce the first algorithm for off-policy temporal-difference learning that is stable with linear function approximation. Off-policy learning is of interest because it forms...
Doina Precup, Richard S. Sutton, Sanjoy Dasgupta
ATAL
2004
Springer
15 years 8 months ago
Best-Response Multiagent Learning in Non-Stationary Environments
This paper investigates a relatively new direction in Multiagent Reinforcement Learning. Most multiagent learning techniques focus on Nash equilibria as elements of both the learn...
Michael Weinberg, Jeffrey S. Rosenschein
ICML
1994
IEEE
15 years 6 months ago
Learning Without State-Estimation in Partially Observable Markovian Decision Processes
Reinforcement learning (RL) algorithms provide a sound theoretical basis for building learning control architectures for embedded agents. Unfortunately all of the theory and much ...
Satinder P. Singh, Tommi Jaakkola, Michael I. Jord...