Search Sciweavers | Sciweavers

1512 search results - page 112 / 303

» Qualitative reinforcement learning

163

click to vote

ICML
2000
IEEE

192views Machine Learning» more ICML 2000»

Convergence Problems of General-Sum Multiagent Reinforcement Learning

16 years 6 months ago

Download www.cs.ualberta.ca

Stochastic games are a generalization of MDPs to multiple agents, and can be used as a framework for investigating multiagent learning. Hu and Wellman (1998) recently proposed a m...

Michael H. Bowling

claim paper

Read More »

183

click to vote

ICML
2006
IEEE

256views Machine Learning» more ICML 2006»

Automatic basis function construction for approximate dynamic programming and reinforcement learning

15 years 11 months ago

Download www.ece.mcgill.ca

We address the problem of automatically constructing basis functions for linear approximation of the value function of a Markov Decision Process (MDP). Our work builds on results ...

Philipp W. Keller, Shie Mannor, Doina Precup

claim paper

Read More »

185

click to vote

IAT
2005
IEEE

180views Intelligent Agents» more IAT 2005»

Self-Organizing Cognitive Agents and Reinforcement Learning in Multi-Agent Environment

15 years 11 months ago

Download www3.ntu.edu.sg

This paper presents a self-organizing cognitive architecture, known as TD-FALCON, that learns to function through its interaction with the environment. TD-FALCON learns the value ...

Ah-Hwee Tan, Dan Xiao

claim paper

Read More »

148

click to vote

ATAL
2007
Springer

146views Intelligent Agents» more ATAL 2007»

Transfer via inter-task mappings in policy search reinforcement learning

15 years 11 months ago

Download userweb.cs.utexas.edu

The ambitious goal of transfer learning is to accelerate learning on a target task after training on a different, but related, source task. While many past transfer methods have f...

Matthew E. Taylor, Shimon Whiteson, Peter Stone

claim paper

Read More »

179

click to vote

ECML
2007
Springer

170views Machine Learning» more ECML 2007»

Sequence Labeling with Reinforcement Learning and Ranking Algorithms

15 years 7 months ago

Download nieme.lip6.fr

Many problems in areas such as Natural Language Processing, Information Retrieval, or Bioinformatic involve the generic task of sequence labeling. In many cases, the aim is to assi...

Francis Maes, Ludovic Denoyer, Patrick Gallinari

claim paper

Read More »

« Prev « First page 112 / 303 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers