Search Sciweavers | Sciweavers

61 search results - page 3 / 13

» Convergence of synchronous reinforcement learning with linea...

205

click to vote

CDC
2010
IEEE

160views Control Systems» more CDC 2010»

Adaptive bases for Q-learning

15 years 1 months ago

Download webee.technion.ac.il

Abstract-- We consider reinforcement learning, and in particular, the Q-learning algorithm in large state and action spaces. In order to cope with the size of the spaces, a functio...

Dotan Di Castro, Shie Mannor

claim paper

Read More »

215

click to vote

JMLR
2010

119views more JMLR 2010»

A Convergent Online Single Time Scale Actor Critic Algorithm

15 years 1 months ago

Download jmlr.csail.mit.edu

Actor-Critic based approaches were among the first to address reinforcement learning in a general setting. Recently, these algorithms have gained renewed interest due to their gen...

Dotan Di Castro, Ron Meir

claim paper

Read More »

218

click to vote

AAAI
2011

202views Intelligent Agents» more AAAI 2011»

Value Function Approximation in Reinforcement Learning Using the Fourier Basis

14 years 7 months ago

Download people.csail.mit.edu

We describe the Fourier Basis, a linear value function approximation scheme based on the Fourier Series. We empirically evaluate its properties, and demonstrate that it performs w...

George Konidaris, Sarah Osentoski, Philip Thomas

claim paper

Read More »

222

click to vote

NN
2010
Springer

187views Neural Networks» more NN 2010»

Efficient exploration through active learning for value function approximation in reinforcement learning

15 years 1 months ago

Download sugiyama-www.cs.titech.ac.jp

Appropriately designing sampling policies is highly important for obtaining better control policies in reinforcement learning. In this paper, we first show that the least-squares ...

Takayuki Akiyama, Hirotaka Hachiya, Masashi Sugiya...

claim paper

Read More »

173

click to vote

NIPS
1994

178views Information Technology» more NIPS 1994»

Generalization in Reinforcement Learning: Safely Approximating the Value Function

15 years 8 months ago

Download www.ri.cmu.edu

To appear in: G. Tesauro, D. S. Touretzky and T. K. Leen, eds., Advances in Neural Information Processing Systems 7, MIT Press, Cambridge MA, 1995. A straightforward approach to t...

Justin A. Boyan, Andrew W. Moore

claim paper

Read More »

« Prev « First page 3 / 13 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers