Search Sciweavers | Sciweavers

132 search results - page 17 / 27

» Generalization in Reinforcement Learning: Safely Approximati...

133

Voted

ICML
2002
IEEE

146views Machine Learning» more ICML 2002»

Hierarchically Optimal Average Reward Reinforcement Learning

16 years 4 months ago

Download www.cs.ualberta.ca

Two notions of optimality have been explored in previous work on hierarchical reinforcement learning (HRL): hierarchical optimality, or the optimal policy in the space defined by ...

Mohammad Ghavamzadeh, Sridhar Mahadevan

claim paper

Read More »

165

click to vote

IWLCS
2005
Springer

161views Machine Learning» more IWLCS 2005»

Counter Example for Q-Bucket-Brigade Under Prediction Problem

15 years 9 months ago

Download www.cs.bham.ac.uk

Aiming to clarify the convergence or divergence conditions for Learning Classiﬁer System (LCS), this paper explores: (1) an extreme condition where the reinforcement process of ...

Atsushi Wada, Keiki Takadama, Katsunori Shimohara

claim paper

Read More »

125

click to vote

ICML
2010
IEEE

167views Machine Learning» more ICML 2010»

Finite-Sample Analysis of LSTD

15 years 5 months ago

Download hal.inria.fr

In this paper we consider the problem of policy evaluation in reinforcement learning, i.e., learning the value function of a fixed policy, using the least-squares temporal-differe...

Alessandro Lazaric, Mohammad Ghavamzadeh, Ré...

claim paper

Read More »

125

click to vote

NIPS
1997

121views Information Technology» more NIPS 1997»

Generalized Prioritized Sweeping

15 years 5 months ago

Download www.cs.huji.ac.il

Prioritized sweeping is a model-based reinforcement learning method that attempts to focus an agent’s limited computational resources to achieve a good estimate of the value of ...

David Andre, Nir Friedman, Ronald Parr

claim paper

Read More »

140

Voted

ILP
2003
Springer

126views Automated Reasoning» more ILP 2003»

Graph Kernels and Gaussian Processes for Relational Reinforcement Learning

15 years 9 months ago

Download dtai.cs.kuleuven.be

RRL is a relational reinforcement learning system based on Q-learning in relational state-action spaces. It aims to enable agents to learn how to act in an environment that has no ...

Thomas Gärtner, Kurt Driessens, Jan Ramon

claim paper

Read More »

« Prev « First page 17 / 27 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers