Search Sciweavers | Sciweavers

47 search results - page 7 / 10

» Model-Based Reinforcement Learning in a Complex Domain

click to vote

ICML
2003
IEEE

121views Machine Learning» more ICML 2003»

Q-Decomposition for Reinforcement Learning Agents

14 years 8 months ago

Download www.hpl.hp.com

The paper explores a very simple agent design method called Q-decomposition, wherein a complex agent is built from simpler subagents. Each subagent has its own reward function and...

Stuart J. Russell, Andrew Zimdars

claim paper

Read More »

click to vote

UAI
2008

236views Artificial Intelligence» more UAI 2008»

CORL: A Continuous-state Offset-dynamics Reinforcement Learner

13 years 9 months ago

Download uai2008.cs.helsinki.fi

Continuous state spaces and stochastic, switching dynamics characterize a number of rich, realworld domains, such as robot navigation across varying terrain. We describe a reinfor...

Emma Brunskill, Bethany R. Leffler, Lihong Li, Mic...

claim paper

Read More »

click to vote

ICCS
1993
Springer

99views Applied Computing» more ICCS 1993»

Towards Domain-Independent Machine Intelligence

13 years 11 months ago

Download www.soe.ucsc.edu

Adaptive predictive search (APS), is a learning system framework, which given little initial domain knowledge, increases its decision-making abilities in complex problems domains....

Robert Levinson

claim paper

Read More »

click to vote

AGENTS
1999
Springer

105views Security Privacy» more AGENTS 1999»

Team-Partitioned, Opaque-Transition Reinforcement Learning

13 years 11 months ago

Download www.cs.ucf.edu

In this paper, we present a novel multi-agent learning paradigm called team-partitioned, opaque-transition reinforcement learning (TPOT-RL). TPOT-RL introduces the concept of usin...

Peter Stone, Manuela M. Veloso

claim paper

Read More »

click to vote

ICML
2003
IEEE

151views Machine Learning» more ICML 2003»

Hierarchical Policy Gradient Algorithms

14 years 8 months ago

Download www.hpl.hp.com

Hierarchical reinforcement learning is a general framework which attempts to accelerate policy learning in large domains. On the other hand, policy gradient reinforcement learning...

Mohammad Ghavamzadeh, Sridhar Mahadevan

claim paper

Read More »

« Prev « First page 7 / 10 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers