Search Sciweavers | Sciweavers

340 search results - page 46 / 68

» Kernelized value function approximation for reinforcement le...

121

click to vote

FBIT
2007
IEEE

142views Information Technology» more FBIT 2007»

Learning to Drive a Real Car in 20 Minutes

15 years 8 months ago

Download www.ni.uos.de

The paper describes our ﬁrst experiments on Reinforcement Learning to steer a real robot car. The applied method, Neural Fitted Q Iteration (NFQ) is purely data-driven based on ...

Martin Riedmiller, Michael Montemerlo, Hendrik Dah...

claim paper

Read More »

click to vote

COLT
2008
Springer

139views Machine Learning» more COLT 2008»

Dimension and Margin Bounds for Reflection-invariant Kernels

15 years 4 months ago

Download colt2008.cs.helsinki.fi

A kernel over the Boolean domain is said to be reflection-invariant, if its value does not change when we flip the same bit in both arguments. (Many popular kernels have this prop...

Thorsten Doliwa, Michael Kallweit, Hans-Ulrich Sim...

claim paper

Read More »

108

click to vote

ICML
2002
IEEE

113views Machine Learning» more ICML 2002»

Learning from Scarce Experience

16 years 3 months ago

Download www.cs.ucr.edu

Searching the space of policies directly for the optimal policy has been one popular method for solving partially observable reinforcement learning problems. Typically, with each ...

Leonid Peshkin, Christian R. Shelton

claim paper

Read More »

108

click to vote

GECCO
2009
Springer

200views Optimization» more GECCO 2009»

Apply ant colony optimization to Tetris

15 years 9 months ago

Download cs.nju.edu.cn

Tetris is a falling block game where the player’s objective is to arrange a sequence of diﬀerent shaped tetrominoes smoothly in order to survive. In the intelligence games, ag...

Xingguo Chen, Hao Wang, Weiwei Wang, Yinghuan Shi,...

claim paper

Read More »

112

Voted

GECCO
2009
Springer

82views Optimization» more GECCO 2009»

On the scalability of XCS(F)

15 years 9 months ago

Download www.coboslab.psychologie.uni-wuerzburg.de

Many successful applications have proven the potential of Learning Classiﬁer Systems and the XCS classiﬁer system in particular in datamining, reinforcement learning, and func...

Patrick O. Stalph, Martin V. Butz, David E. Goldbe...

claim paper

Read More »

« Prev « First page 46 / 68 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers