Search Sciweavers | Sciweavers

4544 search results - page 72 / 909

» Reinforcement Learning with Time

178

click to vote

ATAL
2006
Springer

177views Intelligent Agents» more ATAL 2006»

Convergence analysis for collective vocabulary development

15 years 10 months ago

Download www.isrl.illinois.edu

We study how decentralized agents can develop a shared vocabulary without global coordination. Answering this question can help us understand the emergence of many communication s...

Jun Wang, Les Gasser, Jim Houk

claim paper

Read More »

140

click to vote

JACIII
2006

97views more JACIII 2006»

Opposition-Based Reinforcement Learning

15 years 6 months ago

Download gauss.uwaterloo.ca

In this paper a method for image segmentation using an opposition-based reinforcement learning scheme is introduced. We use this agent-based approach to optimally find the appropri...

Hamid R. Tizhoosh

claim paper

Read More »

116

click to vote

ICML
2004
IEEE

146views Machine Learning» more ICML 2004»

Dynamic abstraction in reinforcement learning via clustering

16 years 7 months ago

Download rlai.cs.ualberta.ca

Abstraction in Reinforcement Learning via Clustering Shie Mannor shie@mit.edu Laboratory for Information and Decision Systems, Massachusetts Institute of Technology, Cambridge, MA ...

Shie Mannor, Ishai Menache, Amit Hoze, Uri Klein

claim paper

Read More »

140

click to vote

ICML
2002
IEEE

155views Machine Learning» more ICML 2002»

Discovering Hierarchy in Reinforcement Learning with HEXQ

16 years 7 months ago

Download www.cs.berkeley.edu

An open problem in reinforcement learning is discovering hierarchical structure. HEXQ, an algorithm which automatically attempts to decompose and solve a model-free factored MDP h...

Bernhard Hengst

claim paper

Read More »

146

click to vote

ECML
2006
Springer

88views Machine Learning» more ECML 2006»

Reinforcement Learning for MDPs with Constraints

15 years 8 months ago

Download www.peter-geibel.de

In this article, I will consider Markov Decision Processes with two criteria, each defined as the expected value of an infinite horizon cumulative return. The second criterion is e...

Peter Geibel

claim paper

Read More »

« Prev « First page 72 / 909 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers