Search Sciweavers | Sciweavers

132 search results - page 3 / 27

» Generalization in Reinforcement Learning: Safely Approximati...

128

Voted

AMAI
2004
Springer

133views Artificial Intelligence» more AMAI 2004»

Explicit Manifold Representations for Value-Function Approximation in Reinforcement Learning

16 years 13 days ago

Download rutcor.rutgers.edu

William D. Smart

claim paper

Read More »

201

click to vote

ATAL
2009
Springer

135views Intelligent Agents» more ATAL 2009»

An empirical analysis of value function-based and policy search reinforcement learning

16 years 1 months ago

Download userweb.cs.utexas.edu

In several agent-oriented scenarios in the real world, an autonomous agent that is situated in an unknown environment must learn through a process of trial and error to take actio...

Shivaram Kalyanakrishnan, Peter Stone

claim paper

Read More »

182

click to vote

ICRA
2009
IEEE

143views Robotics» more ICRA 2009»

Least absolute policy iteration for robust value function approximation

16 years 1 months ago

Download sugiyama-www.cs.titech.ac.jp

Abstract— Least-squares policy iteration is a useful reinforcement learning method in robotics due to its computational efﬁciency. However, it tends to be sensitive to outliers...

Masashi Sugiyama, Hirotaka Hachiya, Hisashi Kashim...

claim paper

Read More »

202

click to vote

ESANN
2008

164views Neural Networks» more ESANN 2008»

Multilayer Perceptrons with Radial Basis Functions as Value Functions in Reinforcement Learning

15 years 8 months ago

Download www.dice.ucl.ac.be

Using multilayer perceptrons (MLPs) to approximate the state-action value function in reinforcement learning (RL) algorithms could become a nightmare due to the constant possibilit...

Victor Uc Cetina

claim paper

Read More »

173

click to vote

NIPS
2001

121views Information Technology» more NIPS 2001»

Rates of Convergence of Performance Gradient Estimates Using Function Approximation and Bias in Reinforcement Learning

15 years 8 months ago

Download books.nips.cc

We address two open theoretical questions in Policy Gradient Reinforcement Learning. The first concerns the efficacy of using function approximation to represent the state action ...

Gregory Z. Grudic, Lyle H. Ungar

claim paper

Read More »

« Prev « First page 3 / 27 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers