ICML 2009 | Sciweavers

35

ICML
2009
IEEE

148views Machine Learning» more ICML 2009»

Predictive representations for policy gradient in POMDPs

15 years 10 days ago

We consider the problem of estimating the policy gradient in Partially Observable Markov Decision Processes (POMDPs) with a special class of policies that are based on Predictive ...

Abdeslam Boularias, Brahim Chaib-draa

claim paper

Read More »

37

click to vote

ICML
2009
IEEE

246views Machine Learning» more ICML 2009»

BoltzRank: learning to maximize expected ranking gain

15 years 10 days ago

Download www.cs.toronto.edu

Ranking a set of retrieved documents according to their relevance to a query is a popular problem in information retrieval. Methods that learn ranking functions are difficult to o...

Maksims Volkovs, Richard S. Zemel

claim paper

Read More »

45

click to vote

ICML
2009
IEEE

159views Machine Learning» more ICML 2009»

SimpleNPKL: simple non-parametric kernel learning

15 years 10 days ago

Download www.cais.ntu.edu.sg

Previous studies of Non-Parametric Kernel (NPK) learning usually reduce to solving some Semi-Definite Programming (SDP) problem by a standard SDP solver. However, time complexity ...

Jinfeng Zhuang, Ivor W. Tsang, Steven C. H. Hoi

claim paper

Read More »

32

click to vote

ICML
2009
IEEE

131views Machine Learning» more ICML 2009»

Monte-Carlo simulation balancing

15 years 10 days ago

Download www.cs.ualberta.ca

In this paper we introduce the first algorithms for efficiently learning a simulation policy for Monte-Carlo search. Our main idea is to optimise the balance of a simulation polic...

David Silver, Gerald Tesauro

claim paper

Read More »

29

click to vote

ICML
2009
IEEE

123views Machine Learning» more ICML 2009»

Constraint relaxation in approximate linear programs

15 years 10 days ago

Download anytime.cs.umass.edu

Approximate Linear Programming (ALP) is a reinforcement learning technique with nice theoretical properties, but it often performs poorly in practice. We identify some reasons for...

Marek Petrik, Shlomo Zilberstein

claim paper

Read More »

48

click to vote

ICML
2009
IEEE

272views Machine Learning» more ICML 2009»

Multi-class image segmentation using conditional random fields and global classification

15 years 10 days ago

Download user.cs.tu-berlin.de

A key aspect of semantic image segmentation is to integrate local and global features for the prediction of local segment labels. We present an approach to multi-class segmentatio...

Nils Plath, Marc Toussaint, Shinichi Nakajima

claim paper

Read More »

32

click to vote

ICML
2009
IEEE

141views Machine Learning» more ICML 2009»

A stochastic memoizer for sequence data

15 years 10 days ago

Download www.gatsby.ucl.ac.uk

We propose an unbounded-depth, hierarchical, Bayesian nonparametric model for discrete sequence data. This model can be estimated from a single training sequence, yet shares stati...

Frank Wood, Cédric Archambeau, Jan Gasthaus...

claim paper

Read More »

37

click to vote

ICML
2009
IEEE

222views Machine Learning» more ICML 2009»

Unsupervised hierarchical modeling of locomotion styles

15 years 10 days ago

Download www.cs.dartmouth.edu

This paper describes an unsupervised learning technique for modeling human locomotion styles, such as distinct related activities (e.g. running and striding) or variations of the ...

Wei Pan, Lorenzo Torresani

claim paper

Read More »

38

click to vote

ICML
2009
IEEE

161views Machine Learning» more ICML 2009»

Matrix updates for perceptron training of continuous density hidden Markov models

15 years 10 days ago

Download www-rcf.usc.edu

In this paper, we investigate a simple, mistakedriven learning algorithm for discriminative training of continuous density hidden Markov models (CD-HMMs). Most CD-HMMs for automat...

Chih-Chieh Cheng, Fei Sha, Lawrence K. Saul

claim paper

Read More »

39

click to vote

ICML
2009
IEEE

157views Machine Learning» more ICML 2009»

Learning structurally consistent undirected probabilistic graphical models

15 years 10 days ago

Download www.cs.unm.edu

In many real-world domains, undirected graphical models such as Markov random fields provide a more natural representation of the dependency structure than directed graphical mode...

Sushmita Roy, Terran Lane, Margaret Werner-Washbur...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers