Search Sciweavers | Sciweavers

3694 search results - page 125 / 739

» Stochastic complexity in learning

122

click to vote

ECML
2007
Springer

192views Machine Learning» more ECML 2007»

Policy Gradient Critics

15 years 9 months ago

Download www.idsia.ch

We present Policy Gradient Actor-Critic (PGAC), a new model-free Reinforcement Learning (RL) method for creating limited-memory stochastic policies for Partially Observable Markov ...

Daan Wierstra, Jürgen Schmidhuber

claim paper

Read More »

116

click to vote

IJAR
2007

96views more IJAR 2007»

Complexity measurement of fundamental pseudo-independent models

15 years 3 months ago

Download www.cis.uoguelph.ca

Pseudo-independent (PI) models are a special class of probabilistic domain model (PDM) where a set of marginally independent domain variables shows collective dependency, a specia...

J. Lee, Y. Xiang

claim paper

Read More »

137

click to vote

ICML
2009
IEEE

227views Machine Learning» more ICML 2009»

Online dictionary learning for sparse coding

16 years 4 months ago

Download www.di.ens.fr

Sparse coding--that is, modelling data vectors as sparse linear combinations of basis elements--is widely used in machine learning, neuroscience, signal processing, and statistics...

Julien Mairal, Francis Bach, Jean Ponce, Guillermo...

claim paper

Read More »

133

click to vote

ICML
2008
IEEE

157views Machine Learning» more ICML 2008»

Efficiently learning linear-linear exponential family predictive representations of state

16 years 4 months ago

Download web.mit.edu

Exponential Family PSR (EFPSR) models capture stochastic dynamical systems by representing state as the parameters of an exponential family distribution over a shortterm window of...

David Wingate, Satinder P. Singh

claim paper

Read More »

136

Voted

ICML
2005
IEEE

100views Machine Learning» more ICML 2005»

Reinforcement learning with Gaussian processes

16 years 4 months ago

Download www.machinelearning.org

Gaussian Process Temporal Difference (GPTD) learning offers a Bayesian solution to the policy evaluation problem of reinforcement learning. In this paper we extend the GPTD framew...

Yaakov Engel, Shie Mannor, Ron Meir

claim paper

Read More »

« Prev « First page 125 / 739 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers