Search Sciweavers | Sciweavers

1236 search results - page 65 / 248

» Opposition-Based Reinforcement Learning

139

click to vote

NIPS
1998

88views Information Technology» more NIPS 1998»

Scheduling Straight-Line Code Using Reinforcement Learning and Rollouts

15 years 7 months ago

Download www.cs.ou.edu

The execution order of a block of computer instructions can make a difference in its running time by a factor of two or more. In order to achieve the best possible speed, compiler...

Amy McGovern, J. Eliot B. Moss

claim paper

Read More »

121

click to vote

ICML
2004
IEEE

146views Machine Learning» more ICML 2004»

Dynamic abstraction in reinforcement learning via clustering

16 years 7 months ago

Download rlai.cs.ualberta.ca

Abstraction in Reinforcement Learning via Clustering Shie Mannor shie@mit.edu Laboratory for Information and Decision Systems, Massachusetts Institute of Technology, Cambridge, MA ...

Shie Mannor, Ishai Menache, Amit Hoze, Uri Klein

claim paper

Read More »

145

click to vote

ICML
2002
IEEE

155views Machine Learning» more ICML 2002»

Discovering Hierarchy in Reinforcement Learning with HEXQ

16 years 7 months ago

Download www.cs.berkeley.edu

An open problem in reinforcement learning is discovering hierarchical structure. HEXQ, an algorithm which automatically attempts to decompose and solve a model-free factored MDP h...

Bernhard Hengst

claim paper

Read More »

150

Voted

ECML
2006
Springer

88views Machine Learning» more ECML 2006»

Reinforcement Learning for MDPs with Constraints

15 years 8 months ago

Download www.peter-geibel.de

In this article, I will consider Markov Decision Processes with two criteria, each defined as the expected value of an infinite horizon cumulative return. The second criterion is e...

Peter Geibel

claim paper

Read More »

150

click to vote

ECML
2004
Springer

139views Machine Learning» more ECML 2004»

Batch Reinforcement Learning with State Importance

15 years 11 months ago

Download www.research.rutgers.edu

Abstract. We investigate the problem of using function approximation in reinforcement learning where the agent’s policy is represented as a classiﬁer mapping states to actions....

Lihong Li, Vadim Bulitko, Russell Greiner

claim paper

Read More »

« Prev « First page 65 / 248 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers