Search Sciweavers | Sciweavers

: In order to scale to problems with large or continuous state-spaces, reinforcement learning algorithms need to be combined with function approximation techniques. The majority of...

Peter Vamplew, Robert Ollington

claim paper

Read More »

183

click to vote

NIPS
1998

155views Information Technology» more NIPS 1998»

Batch and On-Line Parameter Estimation of Gaussian Mixtures Based on the Joint Entropy

15 years 8 months ago

Download users.soe.ucsc.edu

We describe a new iterative method for parameter estimation of Gaussian mixtures. The new method is based on a framework developed by Kivinen and Warmuth for supervised on-line le...

Yoram Singer, Manfred K. Warmuth

claim paper

Read More »

150

click to vote

TIT
2008

76views more TIT 2008»

Improved Risk Tail Bounds for On-Line Algorithms

15 years 6 months ago

Download books.nips.cc

We prove the strongest known bound for the risk of hypotheses selected from the ensemble generated by running a learning algorithm incrementally on the training data. Our result i...

Nicolò Cesa-Bianchi, Claudio Gentile

claim paper

Read More »

196

click to vote

COLT
2003
Springer

141views Machine Learning» more COLT 2003»

On-Line Learning with Imperfect Monitoring

16 years 3 days ago

Download www.ece.mcgill.ca

We study on-line play of repeated matrix games in which the observations of past actions of the other player and the obtained reward are partial and stochastic. We deﬁne the Part...

Shie Mannor, Nahum Shimkin

claim paper

Read More »

« Prev « First page 1 / 66 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers