Sciweavers

36 search results - page 4 / 8
» Posterior Weighted Reinforcement Learning with State Uncerta...
Sort
View
CDC
2008
IEEE
142views Control Systems» more  CDC 2008»
14 years 2 months ago
Convergence of rule-of-thumb learning rules in social networks
— We study the problem of dynamic learning by a social network of agents. Each agent receives a signal about an underlying state and communicates with a subset of agents (his nei...
Daron Acemoglu, Angelia Nedic, Asuman E. Ozdaglar
IJCAI
2001
13 years 9 months ago
R-MAX - A General Polynomial Time Algorithm for Near-Optimal Reinforcement Learning
R-max is a very simple model-based reinforcement learning algorithm which can attain near-optimal average reward in polynomial time. In R-max, the agent always maintains a complet...
Ronen I. Brafman, Moshe Tennenholtz
ATAL
2009
Springer
14 years 2 months ago
Learning of coordination: exploiting sparse interactions in multiagent systems
Creating coordinated multiagent policies in environments with uncertainty is a challenging problem, which can be greatly simplified if the coordination needs are known to be limi...
Francisco S. Melo, Manuela M. Veloso
MICAI
2009
Springer
14 years 2 months ago
A Two-Stage Relational Reinforcement Learning with Continuous Actions for Real Service Robots
Reinforcement Learning is a commonly used technique in robotics, however, traditional algorithms are unable to handle large amounts of data coming from the robot’s sensors, requi...
Julio H. Zaragoza, Eduardo F. Morales
ICIP
2001
IEEE
14 years 9 months ago
Tracking of human activities using shape-encoded particle propagation
We present an approach to tracking human activities in a monocular video. We model the human body by decomposing it into torso and limbs and use simple 3D shapes to approximate th...
Hankyu Moon, Rama Chellappa, Azriel Rosenfeld