Sciweavers

458 search results - page 42 / 92
» Q-Decomposition for Reinforcement Learning Agents
Sort
View
CAEPIA
2011
Springer
12 years 9 months ago
Evaluating a Reinforcement Learning Algorithm with a General Intelligence Test
In this paper we apply the recent notion of anytime universal intelligence tests to the evaluation of a popular reinforcement learning algorithm, Q-learning. We show that a general...
Javier Insa-Cabrera, David L. Dowe, José He...
ICML
2004
IEEE
14 years 9 months ago
Adaptive cognitive orthotics: combining reinforcement learning and constraint-based temporal reasoning
Reminder systems support people with impaired prospective memory and/or executive function, by providing them with reminders of their functional daily activities. We integrate tem...
Matthew R. Rudary, Satinder P. Singh, Martha E. Po...
AAAI
2007
13 years 11 months ago
Active Imitation Learning
Imitation learning, also called learning by watching or programming by demonstration, has emerged as a means of accelerating many reinforcement learning tasks. Previous work has s...
Aaron P. Shon, Deepak Verma, Rajesh P. N. Rao
AAAI
2007
13 years 11 months ago
Temporal Difference and Policy Search Methods for Reinforcement Learning: An Empirical Comparison
Reinforcement learning (RL) methods have become popular in recent years because of their ability to solve complex tasks with minimal feedback. Both genetic algorithms (GAs) and te...
Matthew E. Taylor, Shimon Whiteson, Peter Stone
ATAL
2004
Springer
14 years 2 months ago
Bayesian Reinforcement Learning for Coalition Formation under Uncertainty
Research on coalition formation usually assumes the values of potential coalitions to be known with certainty. Furthermore, settings in which agents lack sufficient knowledge of ...
Georgios Chalkiadakis, Craig Boutilier