Search Sciweavers | Sciweavers

128 search results - page 18 / 26

» Hierarchically Optimal Average Reward Reinforcement Learning

270

click to vote

CORR
2012
Springer

216views Education» more CORR 2012»

Fractional Moments on Bandit Problems

14 years 3 months ago

Download www.cse.iitm.ac.in

Reinforcement learning addresses the dilemma between exploration to ﬁnd profitable actions and exploitation to act according to the best observations already made. Bandit proble...

Ananda Narayanan B., Balaraman Ravindran

claim paper

Read More »

217

click to vote

CEC
2005
IEEE

98views Artificial Intelligence» more CEC 2005»

XCS with computed prediction in continuous multistep environments

15 years 9 months ago

Download www.eskimo.com

We apply XCS with computed prediction (XCSF) to tackle multistep reinforcement learning problems involving continuous inputs. In essence we use XCSF as a method of generalized rein...

Pier Luca Lanzi, Daniele Loiacono, Stewart W. Wils...

claim paper

Read More »

212

click to vote

ISCA
2008
IEEE

137views Hardware» more ISCA 2008»

Self-Optimizing Memory Controllers: A Reinforcement Learning Approach

16 years 1 months ago

Download www.csl.cornell.edu

Eﬃciently utilizing oﬀ-chip DRAM bandwidth is a critical issue in designing cost-eﬀective, high-performance chip multiprocessors (CMPs). Conventional memory controllers deli...

Engin Ipek, Onur Mutlu, José F. Martí...

claim paper

Read More »

251

click to vote

IEEEPACT
2008
IEEE

136views Distributed And Parallel Com...» more IEEEPACT 2008»

Feature selection and policy optimization for distributed instruction placement using reinforcement learning

16 years 1 months ago

Download userweb.cs.utexas.edu

Communication overheads are one of the fundamental challenges in a multiprocessor system. As the number of processors on a chip increases, communication overheads and the distribu...

Katherine E. Coons, Behnam Robatmili, Matthew E. T...

claim paper

Read More »

160

click to vote

AAAI
2008

141views Intelligent Agents» more AAAI 2008»

Economic Hierarchical Q-Learning

15 years 9 months ago

Download www.aaai.org

Hierarchical state decompositions address the curse-ofdimensionality in Q-learning methods for reinforcement learning (RL) but can suffer from suboptimality. In addressing this, w...

Erik G. Schultink, Ruggiero Cavallo, David C. Park...

claim paper

Read More »

« Prev « First page 18 / 26 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers