Search Sciweavers | Sciweavers

995 search results - page 51 / 199

» Learning Useful Horn Approximations

140

click to vote

NIPS
2007

164views Information Technology» more NIPS 2007»

Incremental Natural Actor-Critic Algorithms

15 years 5 months ago

Download books.nips.cc

We present four new reinforcement learning algorithms based on actor-critic and natural-gradient ideas, and provide their convergence proofs. Actor-critic reinforcement learning m...

Shalabh Bhatnagar, Richard S. Sutton, Mohammad Gha...

claim paper

Read More »

146

click to vote

KDD
2004
ACM

158views Data Mining» more KDD 2004»

A generalized maximum entropy approach to bregman co-clustering and matrix approximation

16 years 4 months ago

Download www.ideal.ece.utexas.edu

Co-clustering is a powerful data mining technique with varied applications such as text clustering, microarray analysis and recommender systems. Recently, an informationtheoretic ...

Arindam Banerjee, Inderjit S. Dhillon, Joydeep Gho...

claim paper

Read More »

139

click to vote

CORR
2010
Springer

119views Education» more CORR 2010»

Dynamic Policy Programming

15 years 4 months ago

Download www.snn.ru.nl

In this paper, we consider the problem of planning and learning in the infinite-horizon discounted-reward Markov decision problems. We propose a novel iterative direct policysearc...

Mohammad Gheshlaghi Azar, Hilbert J. Kappen

claim paper

Read More »

181

click to vote

JMLR
2012

192views Programming Languages» more JMLR 2012»

Detecting Network Cliques with Radon Basis Pursuit

13 years 6 months ago

Download www.math.pku.edu.cn

In this paper, we propose a novel formulation of the network clique detection problem by introducing a general network data representation framework. We show connections between o...

Xiaoye Jiang, Yuan Yao, Han Liu, Leonidas J. Guiba...

claim paper

Read More »

126

click to vote

COLT
2006
Springer

100views Machine Learning» more COLT 2006»

Unifying Divergence Minimization and Statistical Inference Via Convex Duality

15 years 7 months ago

Download ttic.uchicago.edu

Abstract. In this paper we unify divergence minimization and statistical inference by means of convex duality. In the process of doing so, we prove that the dual of approximate max...

Yasemin Altun, Alexander J. Smola

claim paper

Read More »

« Prev « First page 51 / 199 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers