Search Sciweavers | Sciweavers

945 search results - page 179 / 189

» Dialog Convergence and Learning

201

click to vote

CDC
2010
IEEE

136views Control Systems» more CDC 2010»

Pathologies of temporal difference methods in approximate dynamic programming

15 years 2 months ago

Download web.mit.edu

Approximate policy iteration methods based on temporal differences are popular in practice, and have been tested extensively, dating to the early nineties, but the associated conve...

Dimitri P. Bertsekas

claim paper

Read More »

162

click to vote

JAIR
2011

187views more JAIR 2011»

A Monte-Carlo AIXI Approximation

15 years 1 months ago

Download www.hutter1.net

This paper describes a computationally feasible approximation to the AIXI agent, a universal reinforcement learning agent for arbitrary environments. AIXI is scaled down in two ke...

Joel Veness, Kee Siong Ng, Marcus Hutter, William ...

claim paper

Read More »

172

click to vote

JMLR
2010

191views more JMLR 2010»

Noise-contrastive estimation: A new estimation principle for unnormalized statistical models

15 years 1 months ago

Download jmlr.csail.mit.edu

We present a new estimation principle for parameterized statistical models. The idea is to perform nonlinear logistic regression to discriminate between the observed data and some...

Michael Gutmann, Aapo Hyvärinen

claim paper

Read More »

226

click to vote

TKDE
2010

224views more TKDE 2010»

Non-Negative Matrix Factorization for Semisupervised Heterogeneous Data Coclustering

15 years 1 months ago

Download www.cs.wayne.edu

Coclustering heterogeneous data has attracted extensive attention recently due to its high impact on various important applications, such us text mining, image retrieval, and bioin...

Yanhua Chen, Lijun Wang, Ming Dong

claim paper

Read More »

219

click to vote

CORR
2011
Springer

177views Education» more CORR 2011»

Gossip PCA

14 years 10 months ago

Download www.stanford.edu

Eigenvectors of data matrices play an important role in many computational problems, ranging from signal processing to machine learning and control. For instance, algorithms that ...

Satish Babu Korada, Andrea Montanari, Sewoong Oh

claim paper

Read More »

« Prev « First page 179 / 189 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers