Search Sciweavers | Sciweavers

36 search results - page 4 / 8

» Posterior Weighted Reinforcement Learning with State Uncerta...

click to vote

CDC
2008
IEEE

142views Control Systems» more CDC 2008»

Convergence of rule-of-thumb learning rules in social networks

14 years 2 months ago

Download web.mit.edu

— We study the problem of dynamic learning by a social network of agents. Each agent receives a signal about an underlying state and communicates with a subset of agents (his nei...

Daron Acemoglu, Angelia Nedic, Asuman E. Ozdaglar

claim paper

Read More »

click to vote

IJCAI
2001

151views Artificial Intelligence» more IJCAI 2001»

R-MAX - A General Polynomial Time Algorithm for Near-Optimal Reinforcement Learning

13 years 9 months ago

Download jmlr.csail.mit.edu

R-max is a very simple model-based reinforcement learning algorithm which can attain near-optimal average reward in polynomial time. In R-max, the agent always maintains a complet...

Ronen I. Brafman, Moshe Tennenholtz

claim paper

Read More »

click to vote

ATAL
2009
Springer

150views Intelligent Agents» more ATAL 2009»

Learning of coordination: exploiting sparse interactions in multiagent systems

14 years 2 months ago

Download www.cs.cmu.edu

Creating coordinated multiagent policies in environments with uncertainty is a challenging problem, which can be greatly simpliﬁed if the coordination needs are known to be limi...

Francisco S. Melo, Manuela M. Veloso

claim paper

Read More »

click to vote

MICAI
2009
Springer

188views Artificial Intelligence» more MICAI 2009»

A Two-Stage Relational Reinforcement Learning with Continuous Actions for Real Service Robots

14 years 2 months ago

Download ccc.inaoep.mx

Reinforcement Learning is a commonly used technique in robotics, however, traditional algorithms are unable to handle large amounts of data coming from the robot’s sensors, requi...

Julio H. Zaragoza, Eduardo F. Morales

claim paper

Read More »

click to vote

ICIP
2001
IEEE

222views Image Processing» more ICIP 2001»

Tracking of human activities using shape-encoded particle propagation

14 years 9 months ago

Download www.umiacs.umd.edu

We present an approach to tracking human activities in a monocular video. We model the human body by decomposing it into torso and limbs and use simple 3D shapes to approximate th...

Hankyu Moon, Rama Chellappa, Azriel Rosenfeld

claim paper

Read More »

« Prev « First page 4 / 8 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers