Search Sciweavers | Sciweavers

374 search results - page 40 / 75

» Multiagent Reinforcement Learning: Theoretical Framework and...

click to vote

ICML
2001
IEEE

185views Machine Learning» more ICML 2001»

Off-Policy Temporal Difference Learning with Function Approximation

14 years 8 months ago

Download www.cs.ualberta.ca

We introduce the first algorithm for off-policy temporal-difference learning that is stable with linear function approximation. Off-policy learning is of interest because it forms...

Doina Precup, Richard S. Sutton, Sanjoy Dasgupta

claim paper

Read More »

click to vote

Publication

233views

Sparse reward processes

12 years 6 months ago

Download arxiv.org

We introduce a class of learning problems where the agent is presented with a series of tasks. Intuitively, if there is relation among those tasks, then the information gained duri...

Christos Dimitrakakis

posted by olethros

Read More »

click to vote

ICML
2003
IEEE

129views Machine Learning» more ICML 2003»

Relativized Options: Choosing the Right Transformation

14 years 8 months ago

Download www-anw.cs.umass.edu

Relativized options combine model minimization methods and a hierarchical reinforcement learning framework to derive compact reduced representations of a related family of tasks. ...

Balaraman Ravindran, Andrew G. Barto

claim paper

Read More »

click to vote

ICML
2003
IEEE

171views Machine Learning» more ICML 2003»

Learning To Cooperate in a Social Dilemma: A Satisficing Approach to Bargaining

14 years 8 months ago

Download www.aaai.org

Learning in many multi-agent settings is inherently repeated play. This calls into question the naive application of single play Nash equilibria in multi-agent learning and sugges...

Jeff L. Stimpson, Michael A. Goodrich

claim paper

Read More »

click to vote

CVPR
2005
IEEE

250views Computer Vision» more CVPR 2005»

A Semi-Supervised Active Learning Framework for Image Retrieval

14 years 9 months ago

Download www.cse.cuhk.edu.hk

Although recent studies have shown that unlabeled data are beneficial to boosting the image retrieval performance, very few approaches for image retrieval can learn with labeled a...

Steven C. H. Hoi, Michael R. Lyu

claim paper

Read More »

« Prev « First page 40 / 75 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers