Search Sciweavers | Sciweavers

533 search results - page 21 / 107

» Playing games with approximation algorithms

click to vote

ML
2000
ACM

126views Machine Learning» more ML 2000»

Learning to Play Chess Using Temporal Differences

13 years 8 months ago

Download www.cs.princeton.edu

In this paper we present TDLEAF( ), a variation on the TD( ) algorithm that enables it to be used in conjunction with game-tree search. We present some experiments in which our che...

Jonathan Baxter, Andrew Tridgell, Lex Weaver

claim paper

Read More »

click to vote

COLT
2004
Springer

99views Machine Learning» more COLT 2004»

Reinforcement Learning for Average Reward Zero-Sum Games

14 years 2 months ago

Download www.ece.mcgill.ca

Abstract. We consider Reinforcement Learning for average reward zerosum stochastic games. We present and analyze two algorithms. The ﬁrst is based on relative Q-learning and the ...

Shie Mannor

claim paper

Read More »

click to vote

COCO
2010
Springer

133views Algorithms» more COCO 2010»

Spectral Algorithms for Unique Games

14 years 27 days ago

Download www.math.ias.edu

We present a new algorithm for Unique Games which is based on purely spectral techniques, in contrast to previous work in the area, which relies heavily on semideﬁnite programmi...

Alexandra Kolla

claim paper

Read More »

click to vote

IPCO
2010

125views Optimization» more IPCO 2010»

A Pumping Algorithm for Ergodic Stochastic Mean Payoff Games with Perfect Information

13 years 10 months ago

Download www.mpi-inf.mpg.de

Abstract. We consider two-person zero-sum stochastic mean payoff games with perfect information, or BWR-games, given by a digraph G = (V = VB VW VR, E), with local rewards r : E R...

Endre Boros, Khaled M. Elbassioni, Vladimir Gurvic...

claim paper

Read More »

click to vote

ICML
2010
IEEE

216views Machine Learning» more ICML 2010»

Multi-agent Learning Experiments on Repeated Matrix Games

13 years 10 months ago

Download www.math-info.univ-paris5.fr

This paper experimentally evaluates multiagent learning algorithms playing repeated matrix games to maximize their cumulative return. Previous works assessed that Qlearning surpas...

Bruno Bouzy, Marc Métivier

claim paper

Read More »

« Prev « First page 21 / 107 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers