Search Sciweavers | Sciweavers

2108 search results - page 106 / 422

» Tracking in Reinforcement Learning

121

click to vote

NN
2006
Springer

127views Neural Networks» more NN 2006»

The asymptotic equipartition property in reinforcement learning and its relation to return maximization

15 years 2 months ago

Download www.ece.uvic.ca

We discuss an important property called the asymptotic equipartition property on empirical sequences in reinforcement learning. This states that the typical set of empirical seque...

Kazunori Iwata, Kazushi Ikeda, Hideaki Sakai

claim paper

Read More »

130

click to vote

PKDD
2010
Springer

129views Data Mining» more PKDD 2010»

Smarter Sampling in Model-Based Bayesian Reinforcement Learning

15 years 1 months ago

Download www.cs.mcgill.ca

Abstract. Bayesian reinforcement learning (RL) is aimed at making more efﬁcient use of data samples, but typically uses signiﬁcantly more computation. For discrete Markov Decis...

Pablo Samuel Castro, Doina Precup

claim paper

Read More »

131

click to vote

IS
2010

109views Artificial Intelligence» more IS 2010»

Multicriteria reinforcement learning based on a Russian doll method for network routing

15 years 4 days ago

Download hal.archives-ouvertes.fr

The routing in communication networks is typically a multicriteria decision making (MCDM) problem. However, setting the parameters of most used MCDM methods to fit the preferences ...

Alain Pétrowski, Farouk Aissanou, Ilham Ben...

claim paper

Read More »

166

click to vote

CVPR
2011
IEEE

446views Computer Vision» more CVPR 2011»

Shape Grammar Parsing via Reinforcement Learning

14 years 11 months ago

Download www.mas.ecp.fr

This paper tackles shape grammar parsing for facade segmentation using a novel optimization approach based on reinforcement learning (RL). To this end, we use a binary recursive g...

Olivier Teboul, Iasonas Kokkinos, Panagiotis Kouts...

claim paper

Read More »

153

click to vote

AAAI
2012

205views Intelligent Agents» more AAAI 2012»

Kernel-Based Reinforcement Learning on Representative States

13 years 5 months ago

Download www.bkveton.com

Markov decision processes (MDPs) are an established framework for solving sequential decision-making problems under uncertainty. In this work, we propose a new method for batchmod...

Branislav Kveton, Georgios Theocharous

claim paper

Read More »

« Prev « First page 106 / 422 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers