Sciweavers

945 search results - page 142 / 189
» Dialog Convergence and Learning
Sort
View
COLT
1995
Springer
14 years 1 months ago
A Comparison of New and Old Algorithms for a Mixture Estimation Problem
We investigate the problem of estimating the proportion vector which maximizes the likelihood of a given sample for a mixture of given densities. We adapt a framework developed for...
David P. Helmbold, Yoram Singer, Robert E. Schapir...
ATAL
2008
Springer
13 years 12 months ago
Social reward shaping in the prisoner's dilemma
Reward shaping is a well-known technique applied to help reinforcement-learning agents converge more quickly to nearoptimal behavior. In this paper, we introduce social reward sha...
Monica Babes, Enrique Munoz de Cote, Michael L. Li...
ECIS
2004
13 years 11 months ago
Open University vs. Consorzio Nettuno: an institutional analysis of two techonology enabled higher educational systems
Assuming a rational perspective, the adoption and development of a new organisational technology can be viewed as a way to achieve an higher level of efficiency by finding the bes...
Flavia Blumetti, Paolo Ferri, Cristiano Ghiringhel...
NIPS
2003
13 years 11 months ago
Extending Q-Learning to General Adaptive Multi-Agent Systems
Recent multi-agent extensions of Q-Learning require knowledge of other agents’ payoffs and Q-functions, and assume game-theoretic play at all times by all other agents. This pap...
Gerald Tesauro
GECCO
2008
Springer
172views Optimization» more  GECCO 2008»
13 years 11 months ago
Recursive least squares and quadratic prediction in continuous multistep problems
XCS with computed prediction, namely XCSF, has been recently extended in several ways. In particular, a novel prediction update algorithm based on recursive least squares and the ...
Daniele Loiacono, Pier Luca Lanzi