Search Sciweavers | Sciweavers

61 search results - page 10 / 13

» Convergence of synchronous reinforcement learning with linea...

182

click to vote

GECCO
2009
Springer

200views Optimization» more GECCO 2009»

Apply ant colony optimization to Tetris

16 years 1 months ago

Download cs.nju.edu.cn

Tetris is a falling block game where the player’s objective is to arrange a sequence of diﬀerent shaped tetrominoes smoothly in order to survive. In the intelligence games, ag...

Xingguo Chen, Hao Wang, Weiwei Wang, Yinghuan Shi,...

claim paper

Read More »

216

click to vote

CORR
2010
Springer

119views Education» more CORR 2010»

Dynamic Policy Programming

15 years 7 months ago

Download www.snn.ru.nl

In this paper, we consider the problem of planning and learning in the infinite-horizon discounted-reward Markov decision problems. We propose a novel iterative direct policysearc...

Mohammad Gheshlaghi Azar, Hilbert J. Kappen

claim paper

Read More »

227

click to vote

CORR
2010
Springer

204views Education» more CORR 2010»

Predictive State Temporal Difference Learning

15 years 5 months ago

Download www.cs.cmu.edu

We propose a new approach to value function approximation which combines linear temporal difference reinforcement learning with subspace identiﬁcation. In practical applications...

Byron Boots, Geoffrey J. Gordon

claim paper

Read More »

206

click to vote

NIPS
1998

164views Information Technology» more NIPS 1998»

Finite-Sample Convergence Rates for Q-Learning and Indirect Algorithms

15 years 8 months ago

Download www.cis.upenn.edu

In this paper, we address two issues of long-standing interest in the reinforcement learning literature. First, what kinds of performance guarantees can be made for Q-learning aft...

Michael J. Kearns, Satinder P. Singh

claim paper

Read More »

169

click to vote

IJCNN
2000
IEEE

117views Neural Networks» more IJCNN 2000»

Piecewise Linear Homeomorphisms: The Scalar Case

15 years 11 months ago

Download kodlab.seas.upenn.edu

The class of piecewise linear homeomorphisms (PLH) provides a convenient functional representation for many applications wherein an approximation to data is required that is inver...

Richard E. Groff, Daniel E. Koditschek, Pramod P. ...

claim paper

Read More »

« Prev « First page 10 / 13 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers