Search Sciweavers | Sciweavers

428 search results - page 7 / 86

» An Experts Algorithm for Transfer Learning

178

click to vote

ICML
2004
IEEE

214views Machine Learning» more ICML 2004»

Apprenticeship learning via inverse reinforcement learning

16 years 7 months ago

Download ai.stanford.edu

We consider learning in a Markov decision process where we are not explicitly given a reward function, but where instead we can observe an expert demonstrating the task that we wa...

Pieter Abbeel, Andrew Y. Ng

claim paper

Read More »

131

click to vote

CORR
2007
Springer

95views Education» more CORR 2007»

Prediction with expert advice for the Brier game

15 years 6 months ago

Download www.machinelearning.org

We show that the Brier game of prediction is mixable and ﬁnd the optimal learning rate and substitution function for it. The resulting prediction algorithm is applied to predict...

Vladimir Vovk

claim paper

Read More »

158

click to vote

COLT
2008
Springer

140views Machine Learning» more COLT 2008»

Regret Bounds for Sleeping Experts and Bandits

15 years 8 months ago

Download colt2008.cs.helsinki.fi

We study on-line decision problems where the set of actions that are available to the decision algorithm vary over time. With a few notable exceptions, such problems remained larg...

Robert D. Kleinberg, Alexandru Niculescu-Mizil, Yo...

claim paper

Read More »

160

click to vote

AAAI
2008

141views Intelligent Agents» more AAAI 2008»

Online Learning with Expert Advice and Finite-Horizon Constraints

15 years 9 months ago

Download www.aaai.org

In this paper, we study a sequential decision making problem. The objective is to maximize the average reward accumulated over time subject to temporal cost constraints. The novel...

Branislav Kveton, Jia Yuan Yu, Georgios Theocharou...

claim paper

Read More »

241

click to vote

EUROCOLT
1999
Springer

166views Machine Learning» more EUROCOLT 1999»

Averaging Expert Predictions

15 years 11 months ago

Download users.soe.ucsc.edu

We consider algorithms for combining advice from a set of experts. In each trial, the algorithm receives the predictions of the experts and produces its own prediction. A loss func...

Jyrki Kivinen, Manfred K. Warmuth

claim paper

Read More »

« Prev « First page 7 / 86 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers