Search Sciweavers | Sciweavers

1233 search results - page 21 / 247

» Reinforcement Learning in MirrorBot

click to vote

CG
2006
Springer

155views Computer Graphics» more CG 2006»

Feature Construction for Reinforcement Learning in Hearts

14 years 26 days ago

Download webdocs.cs.ualberta.ca

Temporal difference (TD) learning has been used to learn strong evaluation functions in a variety of two-player games. TD-gammon illustrated how the combination of game tree search...

Nathan R. Sturtevant, Adam M. White

claim paper

Read More »

click to vote

NIPS
2001

131views Information Technology» more NIPS 2001»

The Steering Approach for Multi-Criteria Reinforcement Learning

14 years 8 days ago

Download books.nips.cc

We consider the problem of learning to attain multiple goals in a dynamic environment, which is initially unknown. In addition, the environment may contain arbitrarily varying ele...

Shie Mannor, Nahum Shimkin

claim paper

Read More »

click to vote

ATAL
2008
Springer

133views Intelligent Agents» more ATAL 2008»

Transfer of task representation in reinforcement learning using policy-based proto-value functions

14 years 28 days ago

Download www.aamas-conference.org

Reinforcement Learning research is traditionally devoted to solve single-task problems. Therefore, anytime a new task is faced, learning must be restarted from scratch. Recently, ...

Eliseo Ferrante, Alessandro Lazaric, Marcello Rest...

claim paper

Read More »

click to vote

Publication

240views

Bayesian multitask inverse reinforcement learning

12 years 9 months ago

Download arxiv.org

We generalise the problem of inverse reinforcement learning to multiple tasks, from multiple demonstrations. Each one may represent one expert trying to solve a different task, or ...

Christos Dimitrakakis, Constantin A. Rothkopf

posted by olethros

Read More »

click to vote

TSMC
2008

229views more TSMC 2008»

A Comprehensive Survey of Multiagent Reinforcement Learning

13 years 10 months ago

Download www.dcsc.tudelft.nl

Multiagent systems are rapidly finding applications in a variety of domains, including robotics, distributed control, telecommunications, and economics. The complexity of many task...

Lucian Busoniu, Robert Babuska, Bart De Schutter

claim paper

Read More »

« Prev « First page 21 / 247 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers