Search Sciweavers | Sciweavers

272 search results - page 20 / 55

» Parallel Reinforcement Learning with Linear Function Approxi...

179

click to vote

CG
2006
Springer

155views Computer Graphics» more CG 2006»

Feature Construction for Reinforcement Learning in Hearts

15 years 8 months ago

Download webdocs.cs.ualberta.ca

Temporal difference (TD) learning has been used to learn strong evaluation functions in a variety of two-player games. TD-gammon illustrated how the combination of game tree search...

Nathan R. Sturtevant, Adam M. White

claim paper

Read More »

221

click to vote

ICML
2003
IEEE

150views Machine Learning» more ICML 2003»

The Significance of Temporal-Difference Learning in Self-Play Training TD-Rummy versus EVO-rummy

15 years 12 months ago

Download www.hpl.hp.com

Reinforcement learning has been used for training game playing agents. The value function for a complex game must be approximated with a continuous function because the number of ...

Clifford Kotnik, Jugal K. Kalita

claim paper

Read More »

184

Voted

ATAL
2009
Springer

135views Intelligent Agents» more ATAL 2009»

An empirical analysis of value function-based and policy search reinforcement learning

16 years 1 months ago

Download userweb.cs.utexas.edu

In several agent-oriented scenarios in the real world, an autonomous agent that is situated in an unknown environment must learn through a process of trial and error to take actio...

Shivaram Kalyanakrishnan, Peter Stone

claim paper

Read More »

157

click to vote

NIPS
2001

144views Information Technology» more NIPS 2001»

Variance Reduction Techniques for Gradient Estimates in Reinforcement Learning

15 years 8 months ago

Download jmlr.csail.mit.edu

Policy gradient methods for reinforcement learning avoid some of the undesirable properties of the value function approaches, such as policy degradation (Baxter and Bartlett, 2001...

Evan Greensmith, Peter L. Bartlett, Jonathan Baxte...

claim paper

Read More »

147

click to vote

ICML
2008
IEEE

166views Machine Learning» more ICML 2008»

ManifoldBoost: stagewise function approximation for fully-, semi- and un-supervised learning

16 years 7 months ago

Download reason.cs.uiuc.edu

We introduce a boosting framework to solve a classification problem with added manifold and ambient regularization costs. It allows for a natural extension of boosting into both s...

Nicolas Loeff, David A. Forsyth, Deepak Ramachandr...

claim paper

Read More »

« Prev « First page 20 / 55 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers