Sciweavers

3274 search results - page 189 / 655
» Using Learning in a Control Agent
Sort
View
ECML
2005
Springer
14 years 1 months ago
Using Rewards for Belief State Updates in Partially Observable Markov Decision Processes
Partially Observable Markov Decision Processes (POMDP) provide a standard framework for sequential decision making in stochastic environments. In this setting, an agent takes actio...
Masoumeh T. Izadi, Doina Precup
AIED
2007
Springer
14 years 2 months ago
Does Learner Control Affect Learning?
Many intelligent tutoring systems permit some degree of learner control. A natural question is whether the increased student engagement and motivation such control provides results...
Joseph E. Beck
ATAL
2003
Springer
14 years 1 months ago
How to calm hyperactive agents
System performance in multi-agent resource allocation systems can often improve if individual agents reduce their activity. Agents in such systems need a way to modulate their ind...
H. Van Dyke Parunak, Sven Brueckner, Robert S. Mat...
PKDD
2009
Springer
181views Data Mining» more  PKDD 2009»
14 years 2 months ago
Active Learning for Reward Estimation in Inverse Reinforcement Learning
Abstract. Inverse reinforcement learning addresses the general problem of recovering a reward function from samples of a policy provided by an expert/demonstrator. In this paper, w...
Manuel Lopes, Francisco S. Melo, Luis Montesano
GECCO
2007
Springer
186views Optimization» more  GECCO 2007»
14 years 2 months ago
Evolving controllers for simulated car racing using object oriented genetic programming
Several different controller representations are compared on a non-trivial problem in simulated car racing, with respect to learning speed and final fitness. The controller rep...
Alexandros Agapitos, Julian Togelius, Simon M. Luc...