Sciweavers

326 search results - page 1 / 66
» Reinforcement Learning Based on On-Line EM Algorithm
Sort
View
AUSAI
2005
Springer
14 years 29 days ago
Global Versus Local Constructive Function Approximation for On-Line Reinforcement Learning
: In order to scale to problems with large or continuous state-spaces, reinforcement learning algorithms need to be combined with function approximation techniques. The majority of...
Peter Vamplew, Robert Ollington
NIPS
1998
13 years 8 months ago
Batch and On-Line Parameter Estimation of Gaussian Mixtures Based on the Joint Entropy
We describe a new iterative method for parameter estimation of Gaussian mixtures. The new method is based on a framework developed by Kivinen and Warmuth for supervised on-line le...
Yoram Singer, Manfred K. Warmuth
TIT
2008
76views more  TIT 2008»
13 years 7 months ago
Improved Risk Tail Bounds for On-Line Algorithms
We prove the strongest known bound for the risk of hypotheses selected from the ensemble generated by running a learning algorithm incrementally on the training data. Our result i...
Nicolò Cesa-Bianchi, Claudio Gentile
COLT
2003
Springer
14 years 19 days ago
On-Line Learning with Imperfect Monitoring
We study on-line play of repeated matrix games in which the observations of past actions of the other player and the obtained reward are partial and stochastic. We define the Part...
Shie Mannor, Nahum Shimkin