Search Sciweavers | Sciweavers

2217 search results - page 202 / 444

» Learning from Collective Behavior

125

Voted

IJCAI
2003

169views Artificial Intelligence» more IJCAI 2003»

Covariant Policy Search

15 years 3 months ago

Download www.ri.cmu.edu

We investigate the problem of non-covariant behavior of policy gradient reinforcement learning algorithms. The policy gradient approach is amenable to analysis by information geom...

J. Andrew Bagnell, Jeff G. Schneider

claim paper

Read More »

112

click to vote

TSMC
2002

98views more TSMC 2002»

The STAR automaton: expediency and optimality properties

15 years 2 months ago

Download www.conta.uom.gr

Abstract--We present the STack ARchitecture (STAR) automaton. It is a fixed structure, multiaction, reward-penalty learning automaton, characterized by a star-shaped state transiti...

Anastasios A. Economides, Athanasios Kehagias

claim paper

Read More »

106

click to vote

ICDM
2007
IEEE

132views Data Mining» more ICDM 2007»

Learning What Makes a Society Tick

15 years 8 months ago

Download www.cs.rpi.edu

We present a machine learning methodology (models, algorithms, and experimental data) to discovering the agent dynamics that drive the evolution of the social groups in a communit...

Hung-Ching Chen, Mark K. Goldberg, Malik Magdon-Is...

claim paper

Read More »

112

Voted

EMMCVPR
2005
Springer

143views Computer Vision» more EMMCVPR 2005»

Exploiting Inference for Approximate Parameter Learning in Discriminative Fields: An Empirical Study

15 years 8 months ago

Download www.cs.cmu.edu

Abstract. Estimation of parameters of random ﬁeld models from labeled training data is crucial for their good performance in many image analysis applications. In this paper, we p...

Sanjiv Kumar, Jonas August, Martial Hebert

claim paper

Read More »

118

click to vote

ECAI
2006
Springer

194views Artificial Intelligence» more ECAI 2006»

Strategic Foresighted Learning in Competitive Multi-Agent Games

15 years 6 months ago

Download homepages.cwi.nl

We describe a generalized Q-learning type algorithm for reinforcement learning in competitive multi-agent games. We make the observation that in a competitive setting with adaptive...

Pieter Jan't Hoen, Sander M. Bohte, Han La Poutr&e...

claim paper

Read More »

« Prev « First page 202 / 444 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers