Search Sciweavers | Sciweavers

18 search results - page 1 / 4

» Generating Hierarchical Structure in Reinforcement Learning ...

181

click to vote

PRICAI
2000
Springer

193views Artificial Intelligence» more PRICAI 2000»

Generating Hierarchical Structure in Reinforcement Learning from State Variables

15 years 9 months ago

Download www.csee.umbc.edu

This paper presents the CQ algorithm which decomposes and solves a Markov Decision Process (MDP) by automatically generating a hierarchy of smaller MDPs using state variables. The ...

Bernhard Hengst

claim paper

Read More »

136

click to vote

ICML
2002
IEEE

155views Machine Learning» more ICML 2002»

Discovering Hierarchy in Reinforcement Learning with HEXQ

16 years 6 months ago

Download www.cs.berkeley.edu

An open problem in reinforcement learning is discovering hierarchical structure. HEXQ, an algorithm which automatically attempts to decompose and solve a model-free factored MDP h...

Bernhard Hengst

claim paper

Read More »

177

click to vote

AR
2008

118views more AR 2008»

Efficient Behavior Learning Based on State Value Estimation of Self and Others

15 years 5 months ago

Download www.er.ams.eng.osaka-u.ac.jp

The existing reinforcement learning methods have been seriously suffering from the curse of dimension problem especially when they are applied to multiagent dynamic environments. ...

Yasutake Takahashi, Kentarou Noma, Minoru Asada

claim paper

Read More »

189

click to vote

GECCO
2008
Springer

182views Optimization» more GECCO 2008»

Scaling ant colony optimization with hierarchical reinforcement learning partitioning

15 years 6 months ago

Download www.cs.bham.ac.uk

This paper merges hierarchical reinforcement learning (HRL) with ant colony optimization (ACO) to produce a HRL ACO algorithm capable of generating solutions for large domains. Th...

Erik J. Dries, Gilbert L. Peterson

claim paper

Read More »

151

click to vote

ICML
2005
IEEE

157views Machine Learning» more ICML 2005»

A causal approach to hierarchical decomposition of factored MDPs

16 years 6 months ago

Download www-anw.cs.umass.edu

We present Variable Influence Structure Analysis, an algorithm that dynamically performs hierarchical decomposition of factored Markov decision processes. Our algorithm determines...

Anders Jonsson, Andrew G. Barto

claim paper

Read More »

« Prev « First page 1 / 4 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers