Search Sciweavers | Sciweavers

3412 search results - page 55 / 683

» Efficient Reinforcement Learning

225

click to vote

AGENTS
2001
Springer

201views Security Privacy» more AGENTS 2001»

Using background knowledge to speed reinforcement learning in physical agents

15 years 11 months ago

Download www.isle.org

This paper describes Icarus, an agent architecture that embeds a hierarchical reinforcement learning algorithm within a language for specifying agent behavior. An Icarus program e...

Daniel G. Shapiro, Pat Langley, Ross D. Shachter

claim paper

Read More »

168

click to vote

AAAI
1997

107views Intelligent Agents» more AAAI 1997»

Reinforcement Learning with Time

15 years 8 months ago

Download www.aaai.org

This paper steps back from the standard infinite horizon formulation of reinforcement learning problems to consider the simpler case of finite horizon problems. Although finite ho...

Daishi Harada

claim paper

Read More »

201

click to vote

ICML
2005
IEEE

121views Machine Learning» more ICML 2005»

Combining model-based and instance-based learning for first order regression

16 years 7 months ago

Download www.cs.kuleuven.ac.be

T ORDER REGRESSION (EXTENDED ABSTRACT) Kurt Driessensa Saso Dzeroskib a Department of Computer Science, University of Waikato, Hamilton, New Zealand (kurtd@waikato.ac.nz) b Departm...

Kurt Driessens, Saso Dzeroski

claim paper

Read More »

199

click to vote

ATAL
2007
Springer

111views Intelligent Agents» more ATAL 2007»

IFSA: incremental feature-set augmentation for reinforcement learning tasks

16 years 1 months ago

Download userweb.cs.utexas.edu

Reinforcement learning is a popular and successful framework for many agent-related problems because only limited environmental feedback is necessary for learning. While many algo...

Mazda Ahmadi, Matthew E. Taylor, Peter Stone

claim paper

Read More »

190

click to vote

CG
2006
Springer

155views Computer Graphics» more CG 2006»

Feature Construction for Reinforcement Learning in Hearts

15 years 9 months ago

Download webdocs.cs.ualberta.ca

Temporal difference (TD) learning has been used to learn strong evaluation functions in a variety of two-player games. TD-gammon illustrated how the combination of game tree search...

Nathan R. Sturtevant, Adam M. White

claim paper

Read More »

« Prev « First page 55 / 683 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers