Search Sciweavers | Sciweavers

945 search results - page 142 / 189

» Dialog Convergence and Learning

226

click to vote

COLT
1995
Springer

124views Machine Learning» more COLT 1995»

A Comparison of New and Old Algorithms for a Mixture Estimation Problem

15 years 10 months ago

Download users.soe.ucsc.edu

We investigate the problem of estimating the proportion vector which maximizes the likelihood of a given sample for a mixture of given densities. We adapt a framework developed for...

David P. Helmbold, Yoram Singer, Robert E. Schapir...

claim paper

Read More »

197

Voted

ATAL
2008
Springer

124views Intelligent Agents» more ATAL 2008»

Social reward shaping in the prisoner's dilemma

15 years 9 months ago

Download www.aamas-conference.org

Reward shaping is a well-known technique applied to help reinforcement-learning agents converge more quickly to nearoptimal behavior. In this paper, we introduce social reward sha...

Monica Babes, Enrique Munoz de Cote, Michael L. Li...

claim paper

Read More »

162

click to vote

ECIS
2004

123views Information Technology» more ECIS 2004»

Open University vs. Consorzio Nettuno: an institutional analysis of two techonology enabled higher educational systems

15 years 8 months ago

Download is2.lse.ac.uk

Assuming a rational perspective, the adoption and development of a new organisational technology can be viewed as a way to achieve an higher level of efficiency by finding the bes...

Flavia Blumetti, Paolo Ferri, Cristiano Ghiringhel...

claim paper

Read More »

219

click to vote

NIPS
2003

207views Information Technology» more NIPS 2003»

Extending Q-Learning to General Adaptive Multi-Agent Systems

15 years 8 months ago

Download books.nips.cc

Recent multi-agent extensions of Q-Learning require knowledge of other agents’ payoffs and Q-functions, and assume game-theoretic play at all times by all other agents. This pap...

Gerald Tesauro

claim paper

Read More »

189

click to vote

GECCO
2008
Springer

172views Optimization» more GECCO 2008»

Recursive least squares and quadratic prediction in continuous multistep problems

15 years 8 months ago

Download www.cs.bham.ac.uk

XCS with computed prediction, namely XCSF, has been recently extended in several ways. In particular, a novel prediction update algorithm based on recursive least squares and the ...

Daniele Loiacono, Pier Luca Lanzi

claim paper

Read More »

« Prev « First page 142 / 189 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers