Search Sciweavers | Sciweavers

90 search results - page 5 / 18

» Bayesian Inverse Reinforcement Learning

245

click to vote

Publication

151views

Robust Bayesian reinforcement learning through tight lower bounds

14 years 1 months ago

Download arxiv.org

In the Bayesian approach to sequential decision making, exact calculation of the (subjective) utility is intractable. This extends to most special cases of interest, such as reinfo...

Christos Dimitrakakis

posted by olethros

Read More »

132

click to vote

ICML
2006
IEEE

136views Machine Learning» more ICML 2006»

An analytic solution to discrete Bayesian reinforcement learning

16 years 3 months ago

Download www.cs.uwaterloo.ca

Reinforcement learning (RL) was originally proposed as a framework to allow agents to learn in an online fashion as they interact with their environment. Existing RL algorithms co...

Pascal Poupart, Nikos A. Vlassis, Jesse Hoey, Kevi...

claim paper

Read More »

123

click to vote

AAAI
2010

171views Intelligent Agents» more AAAI 2010»

Reinforcement Learning via AIXI Approximation

15 years 3 months ago

Download jveness.info

This paper introduces a principled approach for the design of a scalable general reinforcement learning agent. This approach is based on a direct approximation of AIXI, a Bayesian...

Joel Veness, Kee Siong Ng, Marcus Hutter, David Si...

claim paper

Read More »

132

click to vote

ICML
2000
IEEE

165views Machine Learning» more ICML 2000»

A Bayesian Framework for Reinforcement Learning

15 years 6 months ago

Download www.ece.uvic.ca

The reinforcement learning problem can be decomposed into two parallel types of inference: (i) estimating the parameters of a model for the underlying process; (ii) determining be...

Malcolm J. A. Strens

claim paper

Read More »

109

click to vote

ICML
2010
IEEE

282views Machine Learning» more ICML 2010»

Bayesian Multi-Task Reinforcement Learning

15 years 3 months ago

Download hal.inria.fr

We consider the problem of multi-task reinforcement learning where the learner is provided with a set of tasks, for which only a small number of samples can be generated for any g...

Alessandro Lazaric, Mohammad Ghavamzadeh

claim paper

Read More »

« Prev « First page 5 / 18 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers