Search Sciweavers | Sciweavers

779 search results - page 41 / 156

» Reinforcement Using Supervised Learning for Policy Generaliz...

click to vote

ICML
2005
IEEE

93views Machine Learning» more ICML 2005»

Relating reinforcement learning performance to classification performance

14 years 8 months ago

Download hunch.net

We prove a quantitative connection between the expected sum of rewards of a policy and binary classification performance on created subproblems. This connection holds without any ...

John Langford, Bianca Zadrozny

claim paper

Read More »

click to vote

ICML
2007
IEEE

172views Machine Learning» more ICML 2007»

Conditional random fields for multi-agent reinforcement learning

14 years 8 months ago

Download www.machinelearning.org

Conditional random fields (CRFs) are graphical models for modeling the probability of labels given the observations. They have traditionally been trained with using a set of obser...

Xinhua Zhang, Douglas Aberdeen, S. V. N. Vishwanat...

claim paper

Read More »

click to vote

ICAC
2006
IEEE

112views Applied Computing» more ICAC 2006»

A Hybrid Reinforcement Learning Approach to Autonomic Resource Allocation

14 years 1 months ago

Download userweb.cs.utexas.edu

— Reinforcement Learning (RL) provides a promising new approach to systems performance management that differs radically from standard queuing-theoretic approaches making use of ...

Gerald Tesauro, Nicholas K. Jong, Rajarshi Das, Mo...

claim paper

Read More »

click to vote

SBIA
2004
Springer

137views Artificial Intelligence» more SBIA 2004»

Heuristically Accelerated Q-Learning: A New Approach to Speed Up Reinforcement Learning

14 years 1 months ago

Download www.fei.edu.br

This work presents a new algorithm, called Heuristically Accelerated Q–Learning (HAQL), that allows the use of heuristics to speed up the well-known Reinforcement Learning algori...

Reinaldo A. C. Bianchi, Carlos H. C. Ribeiro, Anna...

claim paper

Read More »

click to vote

NIPS
2003

105views Information Technology» more NIPS 2003»

Gaussian Processes in Reinforcement Learning

13 years 9 months ago

Download books.nips.cc

We exploit some useful properties of Gaussian process (GP) regression models for reinforcement learning in continuous state spaces and discrete time. We demonstrate how the GP mod...

Carl Edward Rasmussen, Malte Kuss

claim paper

Read More »

« Prev « First page 41 / 156 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers