Search Sciweavers | Sciweavers

536 search results - page 37 / 108

» Residual Algorithms: Reinforcement Learning with Function Ap...

127

Voted

NECO
2010

97views more NECO 2010»

Derivatives of Logarithmic Stationary Distributions for Policy Gradient Reinforcement Learning

15 years 1 months ago

Download www.kyb.tuebingen.mpg.de

Most conventional Policy Gradient Reinforcement Learning (PGRL) algorithms neglect (or do not explicitly make use of) a term in the average reward gradient with respect to the pol...

Tetsuro Morimura, Eiji Uchibe, Junichiro Yoshimoto...

claim paper

Read More »

113

Voted

JMLR
2010

125views more JMLR 2010»

Variational methods for Reinforcement Learning

14 years 9 months ago

Download jmlr.csail.mit.edu

We consider reinforcement learning as solving a Markov decision process with unknown transition distribution. Based on interaction with the environment, an estimate of the transit...

Thomas Furmston, David Barber

claim paper

Read More »

119

Voted

ATAL
2003
Springer

154views Intelligent Agents» more ATAL 2003»

Coordination in multiagent reinforcement learning: a Bayesian approach

15 years 8 months ago

Download www.cs.toronto.edu

Much emphasis in multiagent reinforcement learning (MARL) research is placed on ensuring that MARL algorithms (eventually) converge to desirable equilibria. As in standard reinfor...

Georgios Chalkiadakis, Craig Boutilier

claim paper

Read More »

114

Voted

ICML
2003
IEEE

105views Machine Learning» more ICML 2003»

Principled Methods for Advising Reinforcement Learning Agents

16 years 3 months ago

Download www.hpl.hp.com

An important issue in reinforcement learning is how to incorporate expert knowledge in a principled manner, especially as we scale up to real-world tasks. In this paper, we presen...

Eric Wiewiora, Garrison W. Cottrell, Charles Elkan

claim paper

Read More »

109

Voted

ICML
2008
IEEE

166views Machine Learning» more ICML 2008»

ManifoldBoost: stagewise function approximation for fully-, semi- and un-supervised learning

16 years 3 months ago

Download reason.cs.uiuc.edu

We introduce a boosting framework to solve a classification problem with added manifold and ambient regularization costs. It allows for a natural extension of boosting into both s...

Nicolas Loeff, David A. Forsyth, Deepak Ramachandr...

claim paper

Read More »

« Prev « First page 37 / 108 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers