Search Sciweavers | Sciweavers

124 search results - page 5 / 25

» Basis function construction for hierarchical reinforcement l...

click to vote

IWANN
1999
Springer

115views Neural Networks» more IWANN 1999»

Using Temporal Neighborhoods to Adapt Function Approximators in Reinforcement Learning

13 years 12 months ago

Download www.cs.colostate.edu

To avoid the curse of dimensionality, function approximators are used in reinforcement learning to learn value functions for individual states. In order to make better use of comp...

R. Matthew Kretchmar, Charles W. Anderson

claim paper

Read More »

click to vote

PKDD
2009
Springer

144views Data Mining» more PKDD 2009»

Compositional Models for Reinforcement Learning

14 years 2 months ago

Download userweb.cs.utexas.edu

Abstract. Innovations such as optimistic exploration, function approximation, and hierarchical decomposition have helped scale reinforcement learning to more complex environments, ...

Nicholas K. Jong, Peter Stone

claim paper

Read More »

click to vote

CG
2006
Springer

155views Computer Graphics» more CG 2006»

Feature Construction for Reinforcement Learning in Hearts

13 years 9 months ago

Download webdocs.cs.ualberta.ca

Temporal difference (TD) learning has been used to learn strong evaluation functions in a variety of two-player games. TD-gammon illustrated how the combination of game tree search...

Nathan R. Sturtevant, Adam M. White

claim paper

Read More »

click to vote

ICML
2002
IEEE

146views Machine Learning» more ICML 2002»

Hierarchically Optimal Average Reward Reinforcement Learning

14 years 8 months ago

Download www.cs.ualberta.ca

Two notions of optimality have been explored in previous work on hierarchical reinforcement learning (HRL): hierarchical optimality, or the optimal policy in the space defined by ...

Mohammad Ghavamzadeh, Sridhar Mahadevan

claim paper

Read More »

click to vote

ATAL
2008
Springer

133views Intelligent Agents» more ATAL 2008»

Transfer of task representation in reinforcement learning using policy-based proto-value functions

13 years 9 months ago

Download www.aamas-conference.org

Reinforcement Learning research is traditionally devoted to solve single-task problems. Therefore, anytime a new task is faced, learning must be restarted from scratch. Recently, ...

Eliseo Ferrante, Alessandro Lazaric, Marcello Rest...

claim paper

Read More »

« Prev « First page 5 / 25 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers