Search Sciweavers | Sciweavers

54 search results - page 4 / 11

» Convergence Results for Single-Step On-Policy Reinforcement-...

221

click to vote

NIPS
2001

131views Information Technology» more NIPS 2001»

The Steering Approach for Multi-Criteria Reinforcement Learning

15 years 8 months ago

Download books.nips.cc

We consider the problem of learning to attain multiple goals in a dynamic environment, which is initially unknown. In addition, the environment may contain arbitrarily varying ele...

Shie Mannor, Nahum Shimkin

claim paper

Read More »

182

Voted

CORR
2007
Springer

73views Education» more CORR 2007»

Universal Reinforcement Learning

15 years 7 months ago

Download www.stanford.edu

—We consider an agent interacting with an unmodeled environment. At each time, the agent makes an observation, takes an action, and incurs a cost. Its actions can inﬂuence futu...

Vivek F. Farias, Ciamac Cyrus Moallemi, Tsachy Wei...

claim paper

Read More »

168

click to vote

AAMAS
2007
Springer

142views Intelligent Agents» more AAMAS 2007»

Parallel Reinforcement Learning with Linear Function Approximation

15 years 7 months ago

Download www.aamas-conference.org

In this paper, we investigate the use of parallelization in reinforcement learning (RL), with the goal of learning optimal policies for single-agent RL problems more quickly by us...

Matthew Grounds, Daniel Kudenko

claim paper

Read More »

222

click to vote

ATAL
2010
Springer

115views Intelligent Agents» more ATAL 2010»

Self-organization for coordinating decentralized reinforcement learning

15 years 8 months ago

Download www.cs.umass.edu

Decentralized reinforcement learning (DRL) has been applied to a number of distributed applications. However, one of the main challenges faced by DRL is its convergence. Previous ...

Chongjie Zhang, Victor R. Lesser, Sherief Abdallah

claim paper

Read More »

225

click to vote

JMLR
2010

119views more JMLR 2010»

A Convergent Online Single Time Scale Actor Critic Algorithm

15 years 2 months ago

Download jmlr.csail.mit.edu

Actor-Critic based approaches were among the first to address reinforcement learning in a general setting. Recently, these algorithms have gained renewed interest due to their gen...

Dotan Di Castro, Ron Meir

claim paper

Read More »

« Prev « First page 4 / 11 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers