Search Sciweavers | Sciweavers

92 search results - page 10 / 19

» A General Convergence Method for Reinforcement Learning in t...

click to vote

GECCO
2008
Springer

182views Optimization» more GECCO 2008»

Scaling ant colony optimization with hierarchical reinforcement learning partitioning

13 years 7 months ago

Download www.cs.bham.ac.uk

This paper merges hierarchical reinforcement learning (HRL) with ant colony optimization (ACO) to produce a HRL ACO algorithm capable of generating solutions for large domains. Th...

Erik J. Dries, Gilbert L. Peterson

claim paper

Read More »

click to vote

ATAL
2007
Springer

151views Intelligent Agents» more ATAL 2007»

Batch reinforcement learning in a complex domain

14 years 27 days ago

Download userweb.cs.utexas.edu

Temporal diﬀerence reinforcement learning algorithms are perfectly suited to autonomous agents because they learn directly from an agent’s experience based on sequential actio...

Shivaram Kalyanakrishnan, Peter Stone

claim paper

Read More »

click to vote

ATAL
2009
Springer

146views Intelligent Agents» more ATAL 2009»

Online exploration in least-squares policy iteration

14 years 1 months ago

Download www.aamas-conference.org

One of the key problems in reinforcement learning is balancing exploration and exploitation. Another is learning and acting in large or even continuous Markov decision processes (...

Lihong Li, Michael L. Littman, Christopher R. Mans...

claim paper

Read More »

click to vote

ICML
2009
IEEE

142views Machine Learning» more ICML 2009»

Curriculum learning

14 years 7 months ago

Download snowbird.djvuzone.org

Humans and animals learn much better when the examples are not randomly presented but organized in a meaningful order which illustrates gradually more concepts, and gradually more ...

Jérôme Louradour, Jason Weston, Ronan...

claim paper

Read More »

click to vote

CCGRID
2008
IEEE

127views Distributed And Parallel Com...» more CCGRID 2008»

Grid Differentiated Services: A Reinforcement Learning Approach

14 years 1 months ago

Download hal.inria.fr

—Large scale production grids are a major case for autonomic computing. Following the classical deﬁnition of Kephart, an autonomic computing system should optimize its own beha...

Julien Perez, Cécile Germain-Renaud, Bal&aa...

claim paper

Read More »

« Prev « First page 10 / 19 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers