Search Sciweavers | Sciweavers

28 search results - page 4 / 6

» Multi-agent Learning Experiments on Repeated Matrix Games

207

Voted

COLT
2003
Springer

141views Machine Learning» more COLT 2003»

On-Line Learning with Imperfect Monitoring

16 years 11 days ago

Download www.ece.mcgill.ca

We study on-line play of repeated matrix games in which the observations of past actions of the other player and the obtained reward are partial and stochastic. We deﬁne the Part...

Shie Mannor, Nahum Shimkin

claim paper

Read More »

176

click to vote

ICML
2001
IEEE

127views Machine Learning» more ICML 2001»

Convergence of Gradient Dynamics with a Variable Learning Rate

16 years 8 months ago

Download www.cs.cmu.edu

As multiagent environments become more prevalent we need to understand how this changes the agent-based paradigm. One aspect that is heavily affected by the presence of multiple a...

Michael H. Bowling, Manuela M. Veloso

claim paper

Read More »

166

click to vote

AAAI
1996

118views Intelligent Agents» more AAAI 1996»

Learning Models of Intelligent Agents

15 years 8 months ago

Download www.cs.technion.ac.il

Agents that operate in a multi-agent system need an efficient strategy to handle their encounters with other agents involved. Searching for an optimal interactive strategy is a ha...

David Carmel, Shaul Markovitch

claim paper

Read More »

191

click to vote

IJCAI
2007

262views Artificial Intelligence» more IJCAI 2007»

Emergence of Norms through Social Learning

15 years 8 months ago

Download www.ijcai.org

Behavioral norms are key ingredients that allow agent coordination where societal laws do not sufﬁciently constrain agent behaviors. Whereas social laws need to be enforced in a...

Sandip Sen, Stéphane Airiau

claim paper

Read More »

145

click to vote

ICRA
2010
IEEE

149views Robotics» more ICRA 2010»

A simple learning strategy for high-speed quadrocopter multi-flips

15 years 5 months ago

Download www.idsc.ethz.ch

— We describe a simple and intuitive policy gradient method for improving parametrized quadrocopter multi-ﬂips by combining iterative experiments with information from a ﬁrst...

Sergei Lupashin, Angela Schöllig, Michael She...

claim paper

Read More »

« Prev « First page 4 / 6 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers