Search Sciweavers | Sciweavers

2108 search results - page 64 / 422

» Tracking in Reinforcement Learning

180

click to vote

ML
2002
ACM

114views Machine Learning» more ML 2002»

Building a Basic Block Instruction Scheduler with Reinforcement Learning and Rollouts

15 years 6 months ago

Download www.cs.ou.edu

The execution order of a block of computer instructions on a pipelined machine can make a difference in running time by a factor of two or more. Compilers use heuristic schedulers...

Amy McGovern, J. Eliot B. Moss, Andrew G. Barto

claim paper

Read More »

200

click to vote

CIG
2006
IEEE

190views Applied Computing» more CIG 2006»

Monte-Carlo Go Reinforcement Learning Experiments

16 years 1 months ago

Download www.math-info.univ-paris5.fr

Abstract— This paper describes experiments using reinforcement learning techniques to compute pattern urgencies used during simulations performed in a Monte-Carlo Go architecture...

Bruno Bouzy, Guillaume Chaslot

claim paper

Read More »

203

click to vote

ILP
2003
Springer

126views Automated Reasoning» more ILP 2003»

Graph Kernels and Gaussian Processes for Relational Reinforcement Learning

16 years 13 days ago

Download dtai.cs.kuleuven.be

RRL is a relational reinforcement learning system based on Q-learning in relational state-action spaces. It aims to enable agents to learn how to act in an environment that has no ...

Thomas Gärtner, Kurt Driessens, Jan Ramon

claim paper

Read More »

194

click to vote

ECAI
2008
Springer

83views Artificial Intelligence» more ECAI 2008»

Reinforcement Learning with the Use of Costly Features

15 years 9 months ago

Download people.cs.kuleuven.be

In many practical reinforcement learning problems, the state space is too large to permit an exact representation of the value function, much less the time required to compute it. ...

Robby Goetschalckx, Scott Sanner, Kurt Driessens

claim paper

Read More »

173

click to vote

NIPS
1996

117views Information Technology» more NIPS 1996»

Reinforcement Learning for Mixed Open-loop and Closed-loop Control

15 years 8 months ago

Download anytime.cs.umass.edu

Closed-loop control relies on sensory feedback that is usually assumed to be free. But if sensing incurs a cost, it may be coste ective to take sequences of actions in open-loop m...

Eric A. Hansen, Andrew G. Barto, Shlomo Zilberstei...

claim paper

Read More »

« Prev « First page 64 / 422 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers