Search Sciweavers | Sciweavers

495 search results - page 24 / 99

» Approximation algorithms for budgeted learning problems

click to vote

ATAL
2009
Springer

198views Intelligent Agents» more ATAL 2009»

SarsaLandmark: an algorithm for learning in POMDPs with landmarks

14 years 3 months ago

Download www.aamas-conference.org

Reinforcement learning algorithms that use eligibility traces, such as Sarsa(λ), have been empirically shown to be effective in learning good estimated-state-based policies in pa...

Michael R. James, Satinder P. Singh

claim paper

Read More »

click to vote

ICML
2000
IEEE

155views Machine Learning» more ICML 2000»

Combining Reinforcement Learning with a Local Control Algorithm

14 years 9 months ago

Download www-anw.cs.umass.edu

We explore combining reinforcement learning with a hand-crafted local controller in a manner suggested by the chaotic control algorithm of Vincent, Schmitt and Vincent (1994). A c...

Andrew G. Barto, Jette Randløv, Michael T. ...

claim paper

Read More »

click to vote

JMLR
2008

137views more JMLR 2008»

Online Learning of Complex Prediction Problems Using Simultaneous Projections

13 years 8 months ago

Download jmlr.csail.mit.edu

We describe and analyze an algorithmic framework for online classification where each online trial consists of multiple prediction tasks that are tied together. We tackle the prob...

Yonatan Amit, Shai Shalev-Shwartz, Yoram Singer

claim paper

Read More »

click to vote

ICML
2009
IEEE

172views Machine Learning» more ICML 2009»

Model-free reinforcement learning as mixture learning

14 years 9 months ago

Download user.cs.tu-berlin.de

We cast model-free reinforcement learning as the problem of maximizing the likelihood of a probabilistic mixture model via sampling, addressing both the infinite and finite horizo...

Nikos Vlassis, Marc Toussaint

claim paper

Read More »

click to vote

ICML
2010
IEEE

227views Machine Learning» more ICML 2010»

Learning Efficiently with Approximate Inference via Dual Losses

13 years 9 months ago

Download www.cs.huji.ac.il

Many structured prediction tasks involve complex models where inference is computationally intractable, but where it can be well approximated using a linear programming relaxation...

Ofer Meshi, David Sontag, Tommi Jaakkola, Amir Glo...

claim paper

Read More »

« Prev « First page 24 / 99 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers