Sciweavers

4544 search results - page 17 / 909
» Reinforcement Learning with Time
Sort
View
ML
2002
ACM
114views Machine Learning» more  ML 2002»
13 years 7 months ago
Building a Basic Block Instruction Scheduler with Reinforcement Learning and Rollouts
The execution order of a block of computer instructions on a pipelined machine can make a difference in running time by a factor of two or more. Compilers use heuristic schedulers...
Amy McGovern, J. Eliot B. Moss, Andrew G. Barto
ECAI
2008
Springer
13 years 9 months ago
Reinforcement Learning with the Use of Costly Features
In many practical reinforcement learning problems, the state space is too large to permit an exact representation of the value function, much less the time required to compute it. ...
Robby Goetschalckx, Scott Sanner, Kurt Driessens
ATAL
2004
Springer
14 years 27 days ago
Resource Allocation in the Grid Using Reinforcement Learning
One of the main challenges in Grid computing is eļ¬ƒcient allocation of resources (CPU-hours, network bandwidth, etc.) to the tasks submitted by users. Due to the lack of centrali...
Aram Galstyan, Karl Czajkowski, Kristina Lerman
AI
1998
Springer
13 years 7 months ago
Model-Based Average Reward Reinforcement Learning
Reinforcement Learning (RL) is the study of programs that improve their performance by receiving rewards and punishments from the environment. Most RL methods optimize the discoun...
Prasad Tadepalli, DoKyeong Ok
CEEMAS
2003
Springer
14 years 22 days ago
On a Dynamical Analysis of Reinforcement Learning in Games: Emergence of Occam's Razor
Modeling learning agents in the context of Multi-agent Systems requires an adequate understanding of their dynamic behaviour. Usually, these agents are modeled similar to the diļ¬...
Karl Tuyls, Katja Verbeeck, Sam Maes