Search Sciweavers | Sciweavers

1236 search results - page 60 / 248

» Opposition-Based Reinforcement Learning

165

click to vote

CG
2006
Springer

155views Computer Graphics» more CG 2006»

Feature Construction for Reinforcement Learning in Hearts

15 years 8 months ago

Download webdocs.cs.ualberta.ca

Temporal difference (TD) learning has been used to learn strong evaluation functions in a variety of two-player games. TD-gammon illustrated how the combination of game tree search...

Nathan R. Sturtevant, Adam M. White

claim paper

Read More »

186

click to vote

NIPS
2001

131views Information Technology» more NIPS 2001»

The Steering Approach for Multi-Criteria Reinforcement Learning

15 years 7 months ago

Download books.nips.cc

We consider the problem of learning to attain multiple goals in a dynamic environment, which is initially unknown. In addition, the environment may contain arbitrarily varying ele...

Shie Mannor, Nahum Shimkin

claim paper

Read More »

153

click to vote

ICRA
2008
IEEE

113views Robotics» more ICRA 2008»

Reinforcement learning with function approximation for cooperative navigation tasks

16 years 22 days ago

Download gaips.inesc-id.pt

— In this paper, we propose a reinforcement learning approach to address multi-robot cooperative navigation tasks in inﬁnite settings. We propose an algorithm to simultaneously...

Francisco S. Melo, M. Isabel Ribeiro

claim paper

Read More »

182

click to vote

ATAL
2007
Springer

181views Intelligent Agents» more ATAL 2007»

Multiagent reinforcement learning and self-organization in a network of agents

16 years 14 days ago

Download mas.cs.umass.edu

To cope with large scale, agents are usually organized in a network such that an agent interacts only with its immediate neighbors in the network. Reinforcement learning technique...

Sherief Abdallah, Victor R. Lesser

claim paper

Read More »

173

click to vote

ATAL
2008
Springer

133views Intelligent Agents» more ATAL 2008»

Transfer of task representation in reinforcement learning using policy-based proto-value functions

15 years 8 months ago

Download www.aamas-conference.org

Reinforcement Learning research is traditionally devoted to solve single-task problems. Therefore, anytime a new task is faced, learning must be restarted from scratch. Recently, ...

Eliseo Ferrante, Alessandro Lazaric, Marcello Rest...

claim paper

Read More »

« Prev « First page 60 / 248 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers