Sciweavers

168 search results - page 1 / 34
» Optimism in Reinforcement Learning Based on Kullback-Leibler...
Sort
View
CORR
2010
Springer
105views Education» more  CORR 2010»
13 years 9 months ago
Optimism in Reinforcement Learning Based on Kullback-Leibler Divergence
We consider model-based reinforcement learning in finite Markov Decision Processes (MDPs), focussing on so-called optimistic strategies. Optimism is usually implemented by carryin...
Sarah Filippi, Olivier Cappé, Aurelien Gari...
PKDD
2010
Springer
164views Data Mining» more  PKDD 2010»
13 years 8 months ago
Efficient Planning in Large POMDPs through Policy Graph Based Factorized Approximations
Partially observable Markov decision processes (POMDPs) are widely used for planning under uncertainty. In many applications, the huge size of the POMDP state space makes straightf...
Joni Pajarinen, Jaakko Peltonen, Ari Hottinen, Mik...
CVPR
2005
IEEE
14 years 4 months ago
Jensen-Shannon Boosting Learning for Object Recognition
In this paper, we propose a novel learning method, called Jensen-Shannon Boosting (JSBoost) and demonstrate its application to object recognition. JSBoost incorporates Jensen-Shan...
Xiangsheng Huang, Stan Z. Li, Yangsheng Wang
BMCBI
2008
228views more  BMCBI 2008»
13 years 11 months ago
Adaptive diffusion kernel learning from biological networks for protein function prediction
Background: Machine-learning tools have gained considerable attention during the last few years for analyzing biological networks for protein function prediction. Kernel methods a...
Liang Sun, Shuiwang Ji, Jieping Ye
ICONIP
2004
14 years 7 days ago
An Auxiliary Variational Method
Variational methods have proved popular and effective for inference and learning in intractable graphical models. An attractive feature of the approaches based on the Kullback-Lei...
Felix V. Agakov, David Barber