Search Sciweavers | Sciweavers

1236 search results - page 49 / 248

» Opposition-Based Reinforcement Learning

155

click to vote

NIPS
1997

113views Information Technology» more NIPS 1997»

Nonparametric Model-Based Reinforcement Learning

15 years 7 months ago

Download www.cs.cmu.edu

This paper describes some of the interactions of model learning algorithms and planning algorithms we have found in exploring model-based reinforcement learning. The paper focuses...

Christopher G. Atkeson

claim paper

Read More »

154

click to vote

ICML
2009
IEEE

172views Machine Learning» more ICML 2009»

Model-free reinforcement learning as mixture learning

16 years 7 months ago

Download user.cs.tu-berlin.de

We cast model-free reinforcement learning as the problem of maximizing the likelihood of a probabilistic mixture model via sampling, addressing both the infinite and finite horizo...

Nikos Vlassis, Marc Toussaint

claim paper

Read More »

171

click to vote

ICMLA
2008

195views Machine Learning» more ICMLA 2008»

Basis Function Construction in Reinforcement Learning Using Cascade-Correlation Learning Architecture

15 years 7 months ago

Download www.grappa.univ-lille3.fr

In reinforcement learning, it is a common practice to map the state(-action) space to a different one using basis functions. This transformation aims to represent the input data i...

Sertan Girgin, Philippe Preux

claim paper

Read More »

158

click to vote

ATAL
2008
Springer

151views Intelligent Agents» more ATAL 2008»

Graph Laplacian based transfer learning in reinforcement learning

15 years 8 months ago

Download www.ifaamas.org

The aim of transfer learning is to accelerate learning in related domains. In reinforcement learning, many different features such as a value function and a policy can be transfer...

Yi-Ting Tsao, Ke-Ting Xiao, Von-Wun Soo

claim paper

Read More »

171

click to vote

ICML
2004
IEEE

214views Machine Learning» more ICML 2004»

Apprenticeship learning via inverse reinforcement learning

16 years 7 months ago

Download ai.stanford.edu

We consider learning in a Markov decision process where we are not explicitly given a reward function, but where instead we can observe an expert demonstrating the task that we wa...

Pieter Abbeel, Andrew Y. Ng

claim paper

Read More »

« Prev « First page 49 / 248 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers