Search Sciweavers | Sciweavers

171 search results - page 24 / 35

» Principled Methods for Advising Reinforcement Learning Agent...

153

click to vote

ATAL
2008
Springer

131views Intelligent Agents» more ATAL 2008»

A new perspective to the keepaway soccer: the takers

15 years 8 months ago

Download www.aamas-conference.org

Keepaway is a sub-problem of RoboCup Soccer Simulator in which 'the keepers' try to maintain the possession of the ball, while 'the takers' try to steal the ba...

Atil Iscen, Umut Erogul

claim paper

Read More »

203

click to vote

ICML
2003
IEEE

150views Machine Learning» more ICML 2003»

The Significance of Temporal-Difference Learning in Self-Play Training TD-Rummy versus EVO-rummy

15 years 11 months ago

Download www.hpl.hp.com

Reinforcement learning has been used for training game playing agents. The value function for a complex game must be approximated with a continuous function because the number of ...

Clifford Kotnik, Jugal K. Kalita

claim paper

Read More »

159

Voted

AAAI
2010

191views Intelligent Agents» more AAAI 2010»

Relative Entropy Policy Search

15 years 7 months ago

Download www.kyb.tuebingen.mpg.de

Policy search is a successful approach to reinforcement learning. However, policy improvements often result in the loss of information. Hence, it has been marred by premature conv...

Jan Peters, Katharina Mülling, Yasemin Altun

claim paper

Read More »

170

click to vote

AAAI
2004

120views Intelligent Agents» more AAAI 2004»

Making Better Recommendations with Online Profiling Agents

15 years 7 months ago

Download www.comp.nus.edu.sg

In recent years, we have witnessed the success of autonomous agents applying machine learning techniques across a wide range of applications. However, agents applying the same mac...

Danny Oh, Chew Lim Tan

claim paper

Read More »

184

click to vote

ICML
2000
IEEE

153views Machine Learning» more ICML 2000»

Eligibility Traces for Off-Policy Policy Evaluation

16 years 6 months ago

Download www.cs.ualberta.ca

Eligibility traces have been shown to speed reinforcement learning, to make it more robust to hidden states, and to provide a link between Monte Carlo and temporal-difference meth...

Doina Precup, Richard S. Sutton, Satinder P. Singh

claim paper

Read More »

« Prev « First page 24 / 35 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers