Search Sciweavers | Sciweavers

181 search results - page 21 / 37

» State Space Reduction For Hierarchical Reinforcement Learnin...

146

click to vote

ISDA
2009
IEEE

144views Operating System» more ISDA 2009»

Postponed Updates for Temporal-Difference Reinforcement Learning

15 years 11 months ago

Download www.science.uva.nl

This paper presents postponed updates, a new strategy for TD methods that can improve sample efﬁciency without incurring the computational and space requirements of model-based ...

Harm van Seijen, Shimon Whiteson

claim paper

Read More »

126

click to vote

ICML
2003
IEEE

121views Machine Learning» more ICML 2003»

Q-Decomposition for Reinforcement Learning Agents

16 years 5 months ago

Download www.hpl.hp.com

The paper explores a very simple agent design method called Q-decomposition, wherein a complex agent is built from simpler subagents. Each subagent has its own reward function and...

Stuart J. Russell, Andrew Zimdars

claim paper

Read More »

140

click to vote

IDEAL
2007
Springer

127views Intelligent Agents» more IDEAL 2007»

Skill Combination for Reinforcement Learning

15 years 10 months ago

Download www.cs.qub.ac.uk

Recently researchers have introduced methods to develop reusable knowledge in reinforcement learning (RL). In this paper, we define simple principles to combine skills in reinforce...

Zhihui Luo, David A. Bell, Barry McCollum

claim paper

Read More »

132

click to vote

APN
2003
Springer

142views Artificial Intelligence» more APN 2003»

Model Checking Safety Properties in Modular High-Level Nets

15 years 9 months ago

Download www.cis.hut.fi

Model checking by exhaustive state space enumeration is one of the most developed analysis methods for distributed event systems. Its main problem—the size of the state spaces—...

Marko Mäkelä

claim paper

Read More »

131

click to vote

ICML
1999
IEEE

129views Machine Learning» more ICML 1999»

Implicit Imitation in Multiagent Reinforcement Learning

16 years 5 months ago

Download www.cs.toronto.edu

Imitation is actively being studied as an effective means of learning in multi-agent environments. It allows an agent to learn how to act well (perhaps optimally) by passively obs...

Bob Price, Craig Boutilier

claim paper

Read More »

« Prev « First page 21 / 37 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers