Search Sciweavers | Sciweavers

1236 search results - page 120 / 248

» Opposition-Based Reinforcement Learning

173

click to vote

ATAL
2007
Springer

146views Intelligent Agents» more ATAL 2007»

Transfer via inter-task mappings in policy search reinforcement learning

16 years 26 days ago

Download userweb.cs.utexas.edu

The ambitious goal of transfer learning is to accelerate learning on a target task after training on a different, but related, source task. While many past transfer methods have f...

Matthew E. Taylor, Shimon Whiteson, Peter Stone

claim paper

Read More »

211

Voted

ECML
2007
Springer

170views Machine Learning» more ECML 2007»

Sequence Labeling with Reinforcement Learning and Ranking Algorithms

15 years 8 months ago

Download nieme.lip6.fr

Many problems in areas such as Natural Language Processing, Information Retrieval, or Bioinformatic involve the generic task of sequence labeling. In many cases, the aim is to assi...

Francis Maes, Ludovic Denoyer, Patrick Gallinari

claim paper

Read More »

157

click to vote

AAAI
2004

135views Intelligent Agents» more AAAI 2004»

Performance Bounded Reinforcement Learning in Strategic Interactions

15 years 8 months ago

Download www.aaai.org

Despite increasing deployment of agent technologies in several business and industry domains, user confidence in fully automated agent driven applications is noticeably lacking. T...

Bikramjit Banerjee, Jing Peng

claim paper

Read More »

138

Voted

ICML
2008
IEEE

133views Machine Learning» more ICML 2008»

An analysis of linear models, linear value-function approximation, and feature selection for reinforcement learning

16 years 7 months ago

Download www.cs.duke.edu

We show that linear value-function approximation is equivalent to a form of linear model approximation. We then derive a relationship between the model-approximation error and the...

Ronald Parr, Lihong Li, Gavin Taylor, Christopher ...

claim paper

Read More »

180

click to vote

ICML
1996
IEEE

162views Machine Learning» more ICML 1996»

Sensitive Discount Optimality: Unifying Discounted and Average Reward Reinforcement Learning

16 years 7 months ago

Download reference.kfupm.edu.sa

Research in reinforcementlearning (RL)has thus far concentrated on two optimality criteria: the discounted framework, which has been very well-studied, and the averagereward frame...

Sridhar Mahadevan

claim paper

Read More »

« Prev « First page 120 / 248 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers