Search Sciweavers | Sciweavers

38 search results - page 4 / 8

» The utility of temporal abstraction in reinforcement learnin...

click to vote

CG
2000
Springer

150views Computer Graphics» more CG 2000»

Chess Neighborhoods, Function Combination, and Reinforcement Learning

14 years 8 hour ago

Download users.soe.ucsc.edu

Abstract. Over the years, various research projects have attempted to develop a chess program that learns to play well given little prior knowledge beyond the rules of the game. Ea...

Robert Levinson, Ryan Weber

claim paper

Read More »

click to vote

ECAI
2010
Springer

238views Artificial Intelligence» more ECAI 2010»

The Dynamics of Multi-Agent Reinforcement Learning

13 years 8 months ago

Download www.doc.ic.ac.uk

Abstract. Infinite-horizon multi-agent control processes with nondeterminism and partial state knowledge have particularly interesting properties with respect to adaptive control, ...

Luke Dickens, Krysia Broda, Alessandra Russo

claim paper

Read More »

click to vote

ILP
2007
Springer

283views Automated Reasoning» more ILP 2007»

Building Relational World Models for Reinforcement Learning

14 years 1 months ago

Download ftp.cs.wisc.edu

Abstract. Many reinforcement learning domains are highly relational. While traditional temporal-difference methods can be applied to these domains, they are limited in their capaci...

Trevor Walker, Lisa Torrey, Jude W. Shavlik, Richa...

claim paper

Read More »

click to vote

NIPS
2003

148views Information Technology» more NIPS 2003»

Approximate Planning in POMDPs with Macro-Actions

13 years 9 months ago

Download books.nips.cc

Recent research has demonstrated that useful POMDP solutions do not require consideration of the entire belief space. We extend this idea with the notion of temporal abstraction. ...

Georgios Theocharous, Leslie Pack Kaelbling

claim paper

Read More »

click to vote

ATAL
2004
Springer

97views Intelligent Agents» more ATAL 2004»

Unifying Temporal and Structural Credit Assignment Problems

14 years 1 months ago

Download ti.arc.nasa.gov

Single-agent reinforcement learners in time-extended domains and multi-agent systems share a common dilemma known as the credit assignment problem. Multi-agent systems have the st...

Adrian K. Agogino, Kagan Tumer

claim paper

Read More »

« Prev « First page 4 / 8 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers