Search Sciweavers | Sciweavers

93 search results - page 5 / 19

» Learning to overtake in TORCS using simple reinforcement lea...

153

click to vote

CG
2006
Springer

155views Computer Graphics» more CG 2006»

Feature Construction for Reinforcement Learning in Hearts

15 years 7 months ago

Download webdocs.cs.ualberta.ca

Temporal difference (TD) learning has been used to learn strong evaluation functions in a variety of two-player games. TD-gammon illustrated how the combination of game tree search...

Nathan R. Sturtevant, Adam M. White

claim paper

Read More »

135

click to vote

IROS
2007
IEEE

123views Robotics» more IROS 2007»

Reinforcement learning in multi-dimensional state-action space using random rectangular coarse coding and Gibbs sampling

15 years 11 months ago

Download sysplan.nams.kyushu-u.ac.jp

: This paper presents a coarse coding technique and an action selection scheme for reinforcement learning (RL) in multi-dimensional and continuous state-action spaces following con...

Kimura Kimura

claim paper

Read More »

142

click to vote

ECAI
2008
Springer

83views Artificial Intelligence» more ECAI 2008»

Reinforcement Learning with the Use of Costly Features

15 years 7 months ago

Download people.cs.kuleuven.be

In many practical reinforcement learning problems, the state space is too large to permit an exact representation of the value function, much less the time required to compute it. ...

Robby Goetschalckx, Scott Sanner, Kurt Driessens

claim paper

Read More »

156

click to vote

IDEAL
2007
Springer

127views Intelligent Agents» more IDEAL 2007»

Skill Combination for Reinforcement Learning

15 years 11 months ago

Download www.cs.qub.ac.uk

Recently researchers have introduced methods to develop reusable knowledge in reinforcement learning (RL). In this paper, we define simple principles to combine skills in reinforce...

Zhihui Luo, David A. Bell, Barry McCollum

claim paper

Read More »

156

click to vote

ICRA
2006
IEEE

131views Robotics» more ICRA 2006»

Using Reinforcement Learning to Improve Exploration Trajectories for Error Minimization

15 years 11 months ago

Download mapleleaf.csail.mit.edu

Abstract— The mapping and localization problems have received considerable attention in robotics recently. The exploration problem that drives mapping has started to generate sim...

Thomas Kollar, Nicholas Roy

claim paper

Read More »

« Prev « First page 5 / 19 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers