Search Sciweavers | Sciweavers

227 search results - page 13 / 46

» Generalized multiagent learning with performance bound

178

click to vote

ATAL
2005
Springer

130views Intelligent Agents» more ATAL 2005»

Discovering strategic multi-agent behavior in a robotic soccer domain

16 years 4 days ago

Download www.cs.huji.ac.il

2. THE MASM ALGORITHM An input to the MASM algorithm is a time-annotated multi-agent action sequence. The action sequence is then transformed into an action graph. An action graph ...

Andraz Bezek

claim paper

Read More »

177

Voted

AAAI
2010

218views Intelligent Agents» more AAAI 2010»

Multi-Agent Plan Recognition: Formalization and Algorithms

15 years 8 months ago

Download orca.st.usm.edu

Multi-Agent Plan Recognition (MAPR) seeks to identify the dynamic team structures and team behaviors from the observations of the activity-sequences of a set of intelligent agents...

Bikramjit Banerjee, Landon Kraemer, Jeremy Lyle

claim paper

Read More »

190

click to vote

UAI
2003

172views Artificial Intelligence» more UAI 2003»

On the Convergence of Bound Optimization Algorithms

15 years 8 months ago

Download cs.nyu.edu

Many practitioners who use EM and related algorithms complain that they are sometimes slow. When does this happen, and what can be done about it? In this paper, we study the gener...

Ruslan Salakhutdinov, Sam T. Roweis, Zoubin Ghahra...

claim paper

Read More »

185

click to vote

ICML
2001
IEEE

164views Machine Learning» more ICML 2001»

Learning with the Set Covering Machine

16 years 7 months ago

Download www2.ift.ulaval.ca

We generalize the classical algorithms of Valiant and Haussler for learning conjunctions and disjunctions of Boolean attributes to the problem of learning these functions over arb...

Mario Marchand, John Shawe-Taylor

claim paper

Read More »

167

click to vote

ICML
2010
IEEE

167views Machine Learning» more ICML 2010»

Finite-Sample Analysis of LSTD

15 years 7 months ago

Download hal.inria.fr

In this paper we consider the problem of policy evaluation in reinforcement learning, i.e., learning the value function of a fixed policy, using the least-squares temporal-differe...

Alessandro Lazaric, Mohammad Ghavamzadeh, Ré...

claim paper

Read More »

« Prev « First page 13 / 46 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers