Search Sciweavers | Sciweavers

1799 search results - page 95 / 360

» Filtered Reinforcement Learning

197

click to vote

ICML
2006
IEEE

103views Machine Learning» more ICML 2006»

Using inaccurate models in reinforcement learning

16 years 8 months ago

Download ai.stanford.edu

In the model-based policy search approach to reinforcement learning (RL), policies are found using a model (or "simulator") of the Markov decision process. However, for ...

Pieter Abbeel, Morgan Quigley, Andrew Y. Ng

claim paper

Read More »

194

click to vote

ICML
2004
IEEE

145views Machine Learning» more ICML 2004»

Convergence of synchronous reinforcement learning with linear function approximation

16 years 8 months ago

Download www.machinelearning.org

Synchronous reinforcement learning (RL) algorithms with linear function approximation are representable as inhomogeneous matrix iterations of a special form (Schoknecht & Merk...

Artur Merke, Ralf Schoknecht

claim paper

Read More »

198

click to vote

ICML
2002
IEEE

146views Machine Learning» more ICML 2002»

Hierarchically Optimal Average Reward Reinforcement Learning

16 years 8 months ago

Download www.cs.ualberta.ca

Two notions of optimality have been explored in previous work on hierarchical reinforcement learning (HRL): hierarchical optimality, or the optimal policy in the space defined by ...

Mohammad Ghavamzadeh, Sridhar Mahadevan

claim paper

Read More »

213

click to vote

ISCA
2008
IEEE

137views Hardware» more ISCA 2008»

Self-Optimizing Memory Controllers: A Reinforcement Learning Approach

16 years 1 months ago

Download www.csl.cornell.edu

Eﬃciently utilizing oﬀ-chip DRAM bandwidth is a critical issue in designing cost-eﬀective, high-performance chip multiprocessors (CMPs). Conventional memory controllers deli...

Engin Ipek, Onur Mutlu, José F. Martí...

claim paper

Read More »

305

click to vote

ILP
2007
Springer

283views Automated Reasoning» more ILP 2007»

Building Relational World Models for Reinforcement Learning

16 years 1 months ago

Download ftp.cs.wisc.edu

Abstract. Many reinforcement learning domains are highly relational. While traditional temporal-difference methods can be applied to these domains, they are limited in their capaci...

Trevor Walker, Lisa Torrey, Jude W. Shavlik, Richa...

claim paper

Read More »

« Prev « First page 95 / 360 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers