Search Sciweavers | Sciweavers

45

ICML
2006
IEEE

144views Machine Learning» more ICML 2006»

Quadratic programming relaxations for metric labeling and Markov random field MAP estimation

14 years 10 months ago

Quadratic program relaxations are proposed as an alternative to linear program relaxations and tree reweighted belief propagation for the metric labeling or MAP estimation problem...

Pradeep D. Ravikumar, John D. Lafferty

claim paper

Read More »

35

click to vote

ICML
2006
IEEE

131views Machine Learning» more ICML 2006»

PAC model-free reinforcement learning

14 years 10 months ago

Download cseweb.ucsd.edu

For a Markov Decision Process with finite state (size S) and action spaces (size A per state), we propose a new algorithm--Delayed Q-Learning. We prove it is PAC, achieving near o...

Alexander L. Strehl, Lihong Li, Eric Wiewiora, Joh...

claim paper

Read More »

35

click to vote

ICML
2006
IEEE

108views Machine Learning» more ICML 2006»

Experience-efficient learning in associative bandit problems

14 years 10 months ago

Download paul.rutgers.edu

We formalize the associative bandit problem framework introduced by Kaelbling as a learning-theory problem. The learning environment is modeled as a k-armed bandit where arm payof...

Alexander L. Strehl, Chris Mesterharm, Michael L. ...

claim paper

Read More »

30

click to vote

ICML
2006
IEEE

134views Machine Learning» more ICML 2006»

Statistical debugging: simultaneous identification of multiple bugs

14 years 10 months ago

Download pages.cs.wisc.edu

We describe a statistical approach to software debugging in the presence of multiple bugs. Due to sparse sampling issues and complex interaction between program predicates, many g...

Alice X. Zheng, Michael I. Jordan, Ben Liblit, May...

claim paper

Read More »

28

click to vote

ICML
2006
IEEE

186views Machine Learning» more ICML 2006»

Discriminative unsupervised learning of structured predictors

14 years 10 months ago

Download www.cs.ualberta.ca

We present a new unsupervised algorithm for training structured predictors that is discriminative, convex, and avoids the use of EM. The idea is to formulate an unsupervised versi...

Linli Xu, Dana F. Wilkinson, Finnegan Southey, Dal...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers