Sciweavers

4544 search results - page 147 / 909
» Reinforcement Learning with Time
Sort
View
ICAART
2011
INSTICC
13 years 1 months ago
Optimal Sample Selection for Batch-mode Reinforcement Learning
Emmanuel Rachelson, François Schnitzler, Lo...
SASO
2009
IEEE
14 years 4 months ago
Distributed W-Learning: Multi-Policy Optimization in Self-Organizing Systems
—Large-scale agent-based systems are required to self-optimize towards multiple, potentially conflicting, policies of varying spatial and temporal scope. As a result, not all ag...
Ivana Dusparic, Vinny Cahill
ICRA
2008
IEEE
128views Robotics» more  ICRA 2008»
14 years 4 months ago
Learning from human teachers with Socially Guided Exploration
— We present a learning mechanism, Socially Guided Exploration, in which a robot learns new tasks through a combination of self-exploration and social interaction. The system’s...
Cynthia Breazeal, Andrea Lockerd Thomaz
ATAL
2009
Springer
14 years 4 months ago
Solving multiagent assignment Markov decision processes
We consider the setting of multiple collaborative agents trying to complete a set of tasks as assigned by a centralized controller. We propose a scalable method called“Assignmen...
Scott Proper, Prasad Tadepalli