Search Sciweavers | Sciweavers

56 search results - page 7 / 12

» Reinforcement Learning for Average Reward Zero-Sum Games

150

click to vote

NIPS
2001

144views Information Technology» more NIPS 2001»

Variance Reduction Techniques for Gradient Estimates in Reinforcement Learning

15 years 7 months ago

Download jmlr.csail.mit.edu

Policy gradient methods for reinforcement learning avoid some of the undesirable properties of the value function approaches, such as policy degradation (Baxter and Bartlett, 2001...

Evan Greensmith, Peter L. Bartlett, Jonathan Baxte...

claim paper

Read More »

191

click to vote

AAMAS
2007
Springer

164views Intelligent Agents» more AAMAS 2007»

Networks of Learning Automata and Limiting Games

16 years 7 days ago

Download como.vub.ac.be

Learning Automata (LA) were recently shown to be valuable tools for designing Multi-Agent Reinforcement Learning algorithms. One of the principal contributions of LA theory is that...

Peter Vrancx, Katja Verbeeck, Ann Nowé

claim paper

Read More »

159

click to vote

ATAL
2007
Springer

128views Intelligent Agents» more ATAL 2007»

Advice taking in multiagent reinforcement learning

16 years 6 days ago

Download homepages.inf.ed.ac.uk

This paper proposes the β-WoLF algorithm for multiagent reinforcement learning (MARL) in the stochastic games framework that uses an additional “advice” signal to inform agen...

Michael Rovatsos, Alexandros Belesiotis

claim paper

Read More »

150

click to vote

CORR
2007
Springer

73views Education» more CORR 2007»

Universal Reinforcement Learning

15 years 6 months ago

Download www.stanford.edu

—We consider an agent interacting with an unmodeled environment. At each time, the agent makes an observation, takes an action, and incurs a cost. Its actions can inﬂuence futu...

Vivek F. Farias, Ciamac Cyrus Moallemi, Tsachy Wei...

claim paper

Read More »

151

Voted

ICML
2005
IEEE

137views Machine Learning» more ICML 2005»

Learning to compete, compromise, and cooperate in repeated general-sum games

16 years 6 months ago

Download www.mit.edu

Learning algorithms often obtain relatively low average payoffs in repeated general-sum games between other learning agents due to a focus on myopic best-response and one-shot Nas...

Jacob W. Crandall, Michael A. Goodrich

claim paper

Read More »

« Prev « First page 7 / 12 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers