Search Sciweavers | Sciweavers

125 search results - page 14 / 25

» The Stochastic Machine Replenishment Problem

180

click to vote

WSC
2008

214views Modeling And Simulation» more WSC 2008»

Approximate dynamic programming: Lessons from the field

15 years 9 months ago

Download www.informs-sim.org

Approximate dynamic programming is emerging as a powerful tool for certain classes of multistage stochastic, dynamic problems that arise in operations research. It has been applie...

Warren B. Powell

claim paper

Read More »

195

click to vote

ALT
2010
Springer

342views Machine Learning» more ALT 2010»

Online Multiple Kernel Learning: Algorithms and Mistake Bounds

15 years 8 months ago

Download www.cse.msu.edu

Online learning and kernel learning are two active research topics in machine learning. Although each of them has been studied extensively, there is a limited effort in addressing ...

Rong Jin, Steven C. H. Hoi, Tianbao Yang

claim paper

Read More »

295

click to vote

Publication

233views

Sparse reward processes

14 years 5 months ago

Download arxiv.org

We introduce a class of learning problems where the agent is presented with a series of tasks. Intuitively, if there is relation among those tasks, then the information gained duri...

Christos Dimitrakakis

posted by olethros

Read More »

187

click to vote

ICML
2007
IEEE

275views Machine Learning» more ICML 2007»

Pegasos: Primal Estimated sub-GrAdient SOlver for SVM

16 years 7 months ago

Download ttic.uchicago.edu

We describe and analyze a simple and effective iterative algorithm for solving the optimization problem cast by Support Vector Machines (SVM). Our method alternates between stocha...

Shai Shalev-Shwartz, Yoram Singer, Nathan Srebro

claim paper

Read More »

182

click to vote

ICML
2005
IEEE

100views Machine Learning» more ICML 2005»

Reinforcement learning with Gaussian processes

16 years 7 months ago

Download www.machinelearning.org

Gaussian Process Temporal Difference (GPTD) learning offers a Bayesian solution to the policy evaluation problem of reinforcement learning. In this paper we extend the GPTD framew...

Yaakov Engel, Shie Mannor, Ron Meir

claim paper

Read More »

« Prev « First page 14 / 25 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers