Search Sciweavers | Sciweavers

704 search results - page 72 / 141

» Learning the Ideal Evaluation Function

158

click to vote

ATAL
2008
Springer

110views Intelligent Agents» more ATAL 2008»

A few good agents: multi-agent social learning

15 years 6 months ago

Download www.cs.cmu.edu

In this paper, we investigate multi-agent learning (MAL) in a multi-agent resource selection problem (MARS) in which a large group of agents are competing for common resources. Si...

Jean Oh, Stephen F. Smith

claim paper

Read More »

152

click to vote

NIPS
2008

110views Information Technology» more NIPS 2008»

Signal-to-Noise Ratio Analysis of Policy Gradient Algorithms

15 years 5 months ago

Download groups.csail.mit.edu

Policy gradient (PG) reinforcement learning algorithms have strong (local) convergence guarantees, but their learning performance is typically limited by a large variance in the e...

John W. Roberts, Russ Tedrake

claim paper

Read More »

144

click to vote

CIKM
2010
Springer

171views Information Technology» more CIKM 2010»

Online learning for recency search ranking using real-time user feedback

15 years 3 months ago

Download www.research.rutgers.edu

Traditional machine-learned ranking algorithms for web search are trained in batch mode, which assume static relevance of documents for a given query. Although such a batch-learni...

Taesup Moon, Lihong Li, Wei Chu, Ciya Liao, Zhaohu...

claim paper

Read More »

174

click to vote

PAMI
2007

217views more PAMI 2007»

Discriminative Learning and Recognition of Image Set Classes Using Canonical Correlations

15 years 4 months ago

Download www.iis.ee.ic.ac.uk

—We address the problem of comparing sets of images for object recognition, where the sets may represent variations in an object’s appearance due to changing camera pose and li...

Tae-Kyun Kim, Josef Kittler, Roberto Cipolla

claim paper

Read More »

111

click to vote

IJCAI
2003

149views Artificial Intelligence» more IJCAI 2003»

Constructing utility models from observed negotiation actions

15 years 5 months ago

Download dli.iiit.ac.in

We propose a novel method for constructing utility models by learning from observed negotiation actions. In particular, we show how offers and counter-offers in negotiation can be...

Angelo C. Restificar, Peter Haddawy

claim paper

Read More »

« Prev « First page 72 / 141 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers