Search Sciweavers | Sciweavers

827 search results - page 25 / 166

» Variational methods for Reinforcement Learning

172

click to vote

ICRA
2009
IEEE

143views Robotics» more ICRA 2009»

Least absolute policy iteration for robust value function approximation

16 years 1 months ago

Download sugiyama-www.cs.titech.ac.jp

Abstract— Least-squares policy iteration is a useful reinforcement learning method in robotics due to its computational efﬁciency. However, it tends to be sensitive to outliers...

Masashi Sugiyama, Hirotaka Hachiya, Hisashi Kashim...

claim paper

Read More »

168

click to vote

ECAI
2008
Springer

165views Artificial Intelligence» more ECAI 2008»

Belief revision with reinforcement learning for interactive object recognition

15 years 8 months ago

Download www.inf.fh-dortmund.de

From a conceptual point of view, belief revision and learning are quite similar. Both methods change the belief state of an intelligent agent by processing incoming information. Ho...

Thomas Leopold, Gabriele Kern-Isberner, Gabriele P...

claim paper

Read More »

193

click to vote

PKDD
2010
Springer

129views Data Mining» more PKDD 2010»

Smarter Sampling in Model-Based Bayesian Reinforcement Learning

15 years 5 months ago

Download www.cs.mcgill.ca

Abstract. Bayesian reinforcement learning (RL) is aimed at making more efﬁcient use of data samples, but typically uses signiﬁcantly more computation. For discrete Markov Decis...

Pablo Samuel Castro, Doina Precup

claim paper

Read More »

228

click to vote

CVPR
2011
IEEE

446views Computer Vision» more CVPR 2011»

Shape Grammar Parsing via Reinforcement Learning

15 years 3 months ago

Download www.mas.ecp.fr

This paper tackles shape grammar parsing for facade segmentation using a novel optimization approach based on reinforcement learning (RL). To this end, we use a binary recursive g...

Olivier Teboul, Iasonas Kokkinos, Panagiotis Kouts...

claim paper

Read More »

184

click to vote

ISDA
2009
IEEE

144views Operating System» more ISDA 2009»

Postponed Updates for Temporal-Difference Reinforcement Learning

16 years 1 months ago

Download www.science.uva.nl

This paper presents postponed updates, a new strategy for TD methods that can improve sample efﬁciency without incurring the computational and space requirements of model-based ...

Harm van Seijen, Shimon Whiteson

claim paper

Read More »

« Prev « First page 25 / 166 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers