Search Sciweavers | Sciweavers

150

ML
2002
ACM

114views Machine Learning» more ML 2002»

Building a Basic Block Instruction Scheduler with Reinforcement Learning and Rollouts

15 years 5 months ago

The execution order of a block of computer instructions on a pipelined machine can make a difference in running time by a factor of two or more. Compilers use heuristic schedulers...

Amy McGovern, J. Eliot B. Moss, Andrew G. Barto

claim paper

Read More »

160

click to vote

CIG
2006
IEEE

190views Applied Computing» more CIG 2006»

Monte-Carlo Go Reinforcement Learning Experiments

16 years 1 days ago

Download www.math-info.univ-paris5.fr

Abstract— This paper describes experiments using reinforcement learning techniques to compute pattern urgencies used during simulations performed in a Monte-Carlo Go architecture...

Bruno Bouzy, Guillaume Chaslot

claim paper

Read More »

169

click to vote

ILP
2003
Springer

126views Automated Reasoning» more ILP 2003»

Graph Kernels and Gaussian Processes for Relational Reinforcement Learning

15 years 11 months ago

Download dtai.cs.kuleuven.be

RRL is a relational reinforcement learning system based on Q-learning in relational state-action spaces. It aims to enable agents to learn how to act in an environment that has no ...

Thomas Gärtner, Kurt Driessens, Jan Ramon

claim paper

Read More »

134

click to vote

AI
2004
Springer

113views Artificial Intelligence» more AI 2004»

Multi-attribute Decision Making in a Complex Multiagent Environment Using Reinforcement Learning with Selective Perception

15 years 11 months ago

Download www.damas.ift.ulaval.ca

Abstract. Choosing between multiple alternative tasks is a hard problem for agents evolving in an uncertain real-time multiagent environment. An example of such environment is the ...

Sébastien Paquet, Nicolas Bernier, Brahim C...

claim paper

Read More »

180

click to vote

ECML
2004
Springer

112views Machine Learning» more ECML 2004»

Convergence and Divergence in Standard and Averaging Reinforcement Learning

15 years 11 months ago

Download igitur-archive.library.uu.nl

Although tabular reinforcement learning (RL) methods have been proved to converge to an optimal policy, the combination of particular conventional reinforcement learning techniques...

Marco Wiering

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers