Search Sciweavers | Sciweavers

1512 search results - page 99 / 303

» Qualitative reinforcement learning

160

click to vote

ICML
2002
IEEE

146views Machine Learning» more ICML 2002»

Hierarchically Optimal Average Reward Reinforcement Learning

16 years 6 months ago

Download www.cs.ualberta.ca

Two notions of optimality have been explored in previous work on hierarchical reinforcement learning (HRL): hierarchical optimality, or the optimal policy in the space defined by ...

Mohammad Ghavamzadeh, Sridhar Mahadevan

claim paper

Read More »

170

click to vote

ISCA
2008
IEEE

137views Hardware» more ISCA 2008»

Self-Optimizing Memory Controllers: A Reinforcement Learning Approach

16 years 3 days ago

Download www.csl.cornell.edu

Eﬃciently utilizing oﬀ-chip DRAM bandwidth is a critical issue in designing cost-eﬀective, high-performance chip multiprocessors (CMPs). Conventional memory controllers deli...

Engin Ipek, Onur Mutlu, José F. Martí...

claim paper

Read More »

255

click to vote

ILP
2007
Springer

283views Automated Reasoning» more ILP 2007»

Building Relational World Models for Reinforcement Learning

15 years 12 months ago

Download ftp.cs.wisc.edu

Abstract. Many reinforcement learning domains are highly relational. While traditional temporal-difference methods can be applied to these domains, they are limited in their capaci...

Trevor Walker, Lisa Torrey, Jude W. Shavlik, Richa...

claim paper

Read More »

147

click to vote

AUSAI
2004
Springer

129views Artificial Intelligence» more AUSAI 2004»

A Dynamic Allocation Method of Basis Functions in Reinforcement Learning

15 years 11 months ago

Download www.cs.jhu.edu

In this paper, we propose a dynamic allocation method of basis functions, an Allocation/Elimination Gaussian Softmax Basis Function Network (AE-GSBFN), that is used in reinforcemen...

Shingo Iida, Kiyotake Kuwayama, Masayoshi Kanoh, S...

claim paper

Read More »

159

click to vote

ECML
2004
Springer

137views Machine Learning» more ECML 2004»

Analyzing Multi-agent Reinforcement Learning Using Evolutionary Dynamics

15 years 11 months ago

Download www.personeel.unimaas.nl

In this paper, we show how the dynamics of Q-learning can be visualized and analyzed from a perspective of Evolutionary Dynamics (ED). More speciﬁcally, we show how ED can be use...

Pieter Jan't Hoen, Karl Tuyls

claim paper

Read More »

« Prev « First page 99 / 303 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers