Search Sciweavers | Sciweavers

132 search results - page 5 / 27

» Generalization in Reinforcement Learning: Safely Approximati...

275

click to vote

JMLR
2010

148views more JMLR 2010»

A Generalized Path Integral Control Approach to Reinforcement Learning

15 years 2 months ago

Download jmlr.csail.mit.edu

With the goal to generate more scalable algorithms with higher efficiency and fewer open parameters, reinforcement learning (RL) has recently moved towards combining classical tec...

Evangelos Theodorou, Jonas Buchli, Stefan Schaal

claim paper

Read More »

237

click to vote

AAAI
2006

161views Intelligent Agents» more AAAI 2006»

Sample-Efficient Evolutionary Function Approximation for Reinforcement Learning

15 years 8 months ago

Download staff.science.uva.nl

Reinforcement learning problems are commonly tackled with temporal difference methods, which attempt to estimate the agent's optimal value function. In most real-world proble...

Shimon Whiteson, Peter Stone

claim paper

Read More »

232

click to vote

ICML
2006
IEEE

256views Machine Learning» more ICML 2006»

Automatic basis function construction for approximate dynamic programming and reinforcement learning

16 years 1 months ago

Download www.ece.mcgill.ca

We address the problem of automatically constructing basis functions for linear approximation of the value function of a Markov Decision Process (MDP). Our work builds on results ...

Philipp W. Keller, Shie Mannor, Doina Precup

claim paper

Read More »

180

Voted

AAAI
2008

207views Intelligent Agents» more AAAI 2008»

Adaptive Importance Sampling with Automatic Model Selection in Value Function Approximation

15 years 9 months ago

Download sugiyama-www.cs.titech.ac.jp

Off-policy reinforcement learning is aimed at efficiently reusing data samples gathered in the past, which is an essential problem for physically grounded AI as experiments are us...

Hirotaka Hachiya, Takayuki Akiyama, Masashi Sugiya...

claim paper

Read More »

236

click to vote

ATAL
2005
Springer

181views Intelligent Agents» more ATAL 2005»

Improving reinforcement learning function approximators via neuroevolution

16 years 27 days ago

Download www.aaai.org

Reinforcement learning problems are commonly tackled with temporal difference methods, which use dynamic programming and statistical sampling to estimate the long-term value of ta...

Shimon Whiteson

claim paper

Read More »

« Prev « First page 5 / 27 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers