Search Sciweavers | Sciweavers

272 search results - page 31 / 55

» Parallel Reinforcement Learning with Linear Function Approxi...

200

click to vote

KDD
2006
ACM

213views Data Mining» more KDD 2006»

Learning sparse metrics via linear programming

16 years 7 months ago

Download people.csail.mit.edu

Calculation of object similarity, for example through a distance function, is a common part of data mining and machine learning algorithms. This calculation is crucial for efficie...

Glenn Fung, Rómer Rosales

claim paper

Read More »

191

click to vote

ICML
2010
IEEE

247views Machine Learning» more ICML 2010»

Inverse Optimal Control with Linearly-Solvable MDPs

15 years 7 months ago

Download www.cs.washington.edu

We present new algorithms for inverse optimal control (or inverse reinforcement learning, IRL) within the framework of linearlysolvable MDPs (LMDPs). Unlike most prior IRL algorit...

Dvijotham Krishnamurthy, Emanuel Todorov

claim paper

Read More »

173

click to vote

COLT
1994
Springer

108views Machine Learning» more COLT 1994»

Lower Bounds on the VC-Dimension of Smoothly Parametrized Function Classes

15 years 11 months ago

Download users.cecs.anu.edu.au

We examine the relationship between the VCdimension and the number of parameters of a smoothly parametrized function class. We show that the VC-dimension of such a function class ...

Wee Sun Lee, Peter L. Bartlett, Robert C. Williams...

claim paper

Read More »

204

click to vote

ICCV
2009
IEEE

325views Computer Vision» more ICCV 2009»

Bayesian Poisson regression for crowd counting

15 years 4 months ago

Download www.svcl.ucsd.edu

Poisson regression models the noisy output of a counting function as a Poisson random variable, with a log-mean parameter that is a linear function of the input vector. In this wo...

Antoni B. Chan, Nuno Vasconcelos

claim paper

Read More »

212

click to vote

NIPS
2008

130views Information Technology» more NIPS 2008»

Temporal Difference Based Actor Critic Learning - Convergence and Neural Implementation

15 years 8 months ago

Download eprints.pascal-network.org

Actor-critic algorithms for reinforcement learning are achieving renewed popularity due to their good convergence properties in situations where other approaches often fail (e.g.,...

Dotan Di Castro, Dmitry Volkinshtein, Ron Meir

claim paper

Read More »

« Prev « First page 31 / 55 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers