Sciweavers

ICML
2010
IEEE
14 years 18 days ago
Inverse Optimal Control with Linearly-Solvable MDPs
We present new algorithms for inverse optimal control (or inverse reinforcement learning, IRL) within the framework of linearlysolvable MDPs (LMDPs). Unlike most prior IRL algorit...
Dvijotham Krishnamurthy, Emanuel Todorov
ICML
2010
IEEE
14 years 18 days ago
Bayesian Nonparametric Matrix Factorization for Recorded Music
Recent research in machine learning has focused on breaking audio spectrograms into separate sources of sound using latent variable decompositions. These methods require that the ...
Matthew D. Hoffman, David M. Blei, Perry R. Cook
ICML
2010
IEEE
14 years 18 days ago
Exploiting Data-Independence for Fast Belief-Propagation
Maximum a posteriori (MAP) inference in graphical models requires that we maximize the sum of two terms: a data-dependent term, encoding the conditional likelihood of a certain la...
Julian John McAuley, Tibério S. Caetano
ICML
2010
IEEE
14 years 18 days ago
Simple and Efficient Multiple Kernel Learning by Group Lasso
We consider the problem of how to improve the efficiency of Multiple Kernel Learning (MKL). In literature, MKL is often solved by an alternating approach: (1) the minimization of ...
Zenglin Xu, Rong Jin, Haiqin Yang, Irwin King, Mic...
ICML
2010
IEEE
14 years 18 days ago
A fast natural Newton method
Nicolas Le Roux, Andrew Fitzgibbon
ICML
2010
IEEE
14 years 18 days ago
Continuous-Time Belief Propagation
Many temporal processes can be naturally modeled as a stochastic system that evolves continuously over time. The representation language of continuous-time Bayesian networks allow...
Tal El-Hay, Ido Cohn, Nir Friedman, Raz Kupferman
ICML
2010
IEEE
14 years 18 days ago
Forgetting Counts: Constant Memory Inference for a Dependent Hierarchical Pitman-Yor Process
We propose a novel dependent hierarchical Pitman-Yor process model for discrete data. An incremental Monte Carlo inference procedure for this model is developed. We show that infe...
Nicholas Bartlett, David Pfau, Frank Wood
ICML
2010
IEEE
14 years 18 days ago
Convergence of Least Squares Temporal Difference Methods Under General Conditions
We consider approximate policy evaluation for finite state and action Markov decision processes (MDP) in the off-policy learning context and with the simulation-based least square...
Huizhen Yu
ICML
2010
IEEE
14 years 18 days ago
Budgeted Distribution Learning of Belief Net Parameters
Liuyang Li, Barnabás Póczos, Csaba S...