Sciweavers

2108 search results - page 106 / 422
» Tracking in Reinforcement Learning
Sort
View
NN
2006
Springer
127views Neural Networks» more  NN 2006»
13 years 10 months ago
The asymptotic equipartition property in reinforcement learning and its relation to return maximization
We discuss an important property called the asymptotic equipartition property on empirical sequences in reinforcement learning. This states that the typical set of empirical seque...
Kazunori Iwata, Kazushi Ikeda, Hideaki Sakai
PKDD
2010
Springer
129views Data Mining» more  PKDD 2010»
13 years 8 months ago
Smarter Sampling in Model-Based Bayesian Reinforcement Learning
Abstract. Bayesian reinforcement learning (RL) is aimed at making more efficient use of data samples, but typically uses significantly more computation. For discrete Markov Decis...
Pablo Samuel Castro, Doina Precup
IS
2010
13 years 7 months ago
Multicriteria reinforcement learning based on a Russian doll method for network routing
The routing in communication networks is typically a multicriteria decision making (MCDM) problem. However, setting the parameters of most used MCDM methods to fit the preferences ...
Alain Pétrowski, Farouk Aissanou, Ilham Ben...
CVPR
2011
IEEE
13 years 6 months ago
Shape Grammar Parsing via Reinforcement Learning
This paper tackles shape grammar parsing for facade segmentation using a novel optimization approach based on reinforcement learning (RL). To this end, we use a binary recursive g...
Olivier Teboul, Iasonas Kokkinos, Panagiotis Kouts...
AAAI
2012
12 years 11 days ago
Kernel-Based Reinforcement Learning on Representative States
Markov decision processes (MDPs) are an established framework for solving sequential decision-making problems under uncertainty. In this work, we propose a new method for batchmod...
Branislav Kveton, Georgios Theocharous