Search Sciweavers | Sciweavers

20

ICML
2004
IEEE

122views Machine Learning» more ICML 2004»

Online learning of conditionally I.I.D. data

14 years 1 months ago

In this work we consider the task of relaxing the i.i.d assumption in online pattern recognition (or classiﬁcation), aiming to make existing learning algorithms applicable to a ...

Daniil Ryabko

claim paper

Read More »

22

click to vote

SIAMCOMP
2008

140views more SIAMCOMP 2008»

The Forgetron: A Kernel-Based Perceptron on a Budget

13 years 7 months ago

Download ttic.uchicago.edu

Abstract. The Perceptron algorithm, despite its simplicity, often performs well in online classification tasks. The Perceptron becomes especially effective when it is used in conju...

Ofer Dekel, Shai Shalev-Shwartz, Yoram Singer

claim paper

Read More »

21

click to vote

ICML
2006
IEEE

136views Machine Learning» more ICML 2006»

An analytic solution to discrete Bayesian reinforcement learning

14 years 8 months ago

Download www.cs.uwaterloo.ca

Reinforcement learning (RL) was originally proposed as a framework to allow agents to learn in an online fashion as they interact with their environment. Existing RL algorithms co...

Pascal Poupart, Nikos A. Vlassis, Jesse Hoey, Kevi...

claim paper

Read More »

28

click to vote

JMLR
2010

119views more JMLR 2010»

A Convergent Online Single Time Scale Actor Critic Algorithm

13 years 2 months ago

Download jmlr.csail.mit.edu

Actor-Critic based approaches were among the first to address reinforcement learning in a general setting. Recently, these algorithms have gained renewed interest due to their gen...

Dotan Di Castro, Ron Meir

claim paper

Read More »

31

click to vote

COLT
2010
Springer

217views Machine Learning» more COLT 2010»

Optimal Algorithms for Online Convex Optimization with Multi-Point Bandit Feedback

13 years 5 months ago

Download www.eecs.berkeley.edu

Bandit convex optimization is a special case of online convex optimization with partial information. In this setting, a player attempts to minimize a sequence of adversarially gen...

Alekh Agarwal, Ofer Dekel, Lin Xiao

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers