Search Sciweavers | Sciweavers

1630 search results - page 48 / 326

» Coordinated Reinforcement Learning

113

Voted

AAAI
1997

107views Intelligent Agents» more AAAI 1997»

Reinforcement Learning with Time

15 years 4 months ago

Download www.aaai.org

This paper steps back from the standard infinite horizon formulation of reinforcement learning problems to consider the simpler case of finite horizon problems. Although finite ho...

Daishi Harada

claim paper

Read More »

113

Voted

COLT
2008
Springer

132views Machine Learning» more COLT 2008»

Adaptive Aggregation for Reinforcement Learning with Efficient Exploration: Deterministic Domains

15 years 5 months ago

Download colt2008.cs.helsinki.fi

We propose a model-based learning algorithm, the Adaptive Aggregation Algorithm (AAA), that aims to solve the online, continuous state space reinforcement learning problem in a de...

Andrey Bernstein, Nahum Shimkin

claim paper

Read More »

136

click to vote

ATAL
2007
Springer

111views Intelligent Agents» more ATAL 2007»

IFSA: incremental feature-set augmentation for reinforcement learning tasks

15 years 9 months ago

Download userweb.cs.utexas.edu

Reinforcement learning is a popular and successful framework for many agent-related problems because only limited environmental feedback is necessary for learning. While many algo...

Mazda Ahmadi, Matthew E. Taylor, Peter Stone

claim paper

Read More »

131

click to vote

CG
2006
Springer

155views Computer Graphics» more CG 2006»

Feature Construction for Reinforcement Learning in Hearts

15 years 5 months ago

Download webdocs.cs.ualberta.ca

Temporal difference (TD) learning has been used to learn strong evaluation functions in a variety of two-player games. TD-gammon illustrated how the combination of game tree search...

Nathan R. Sturtevant, Adam M. White

claim paper

Read More »

156

Voted

NIPS
2001

131views Information Technology» more NIPS 2001»

The Steering Approach for Multi-Criteria Reinforcement Learning

15 years 4 months ago

Download books.nips.cc

We consider the problem of learning to attain multiple goals in a dynamic environment, which is initially unknown. In addition, the environment may contain arbitrarily varying ele...

Shie Mannor, Nahum Shimkin

claim paper

Read More »

« Prev « First page 48 / 326 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers