Search Sciweavers | Sciweavers

157

ECML
2005
Springer

106views Machine Learning» more ECML 2005»

16 years 7 days ago

Abstract. Motivated by the analogies to statistical physics, the deterministic annealing (DA) method has successfully been demonstrated in a variety of application. In this paper, ...

Gang Wang, Zhihua Zhang, Frederick H. Lochovsky

claim paper

Read More »

205

click to vote

ECML
2005
Springer

193views Machine Learning» more ECML 2005»

Natural Actor-Critic

16 years 7 days ago

Download www-clmc.usc.edu

This paper investigates a novel model-free reinforcement learning architecture, the Natural Actor-Critic. The actor updates are based on stochastic policy gradients employing Amari...

Jan Peters, Sethu Vijayakumar, Stefan Schaal

claim paper

Read More »

166

click to vote

ECML
2005
Springer

87views Machine Learning» more ECML 2005»

Using Advice to Transfer Knowledge Acquired in One Reinforcement Learning Task to Another

16 years 7 days ago

Download pages.cs.wisc.edu

We present a method for transferring knowledge learned in one task to a related task. Our problem solvers employ reinforcement learning to acquire a model for one task. We then tra...

Lisa Torrey, Trevor Walker, Jude W. Shavlik, Richa...

claim paper

Read More »

172

click to vote

ECML
2005
Springer

108views Machine Learning» more ECML 2005»

A Distance-Based Approach for Action Recommendation

16 years 5 days ago

Download www.lri.fr

Abstract. Rule induction has attracted a great deal of attention in Machine Learning and Data Mining. However, generating rules is not an end in itself because their applicability ...

Ronan Trepos, Ansaf Salleb, Marie-Odile Cordier, V...

claim paper

Read More »

167

click to vote

ECML
2005
Springer

115views Machine Learning» more ECML 2005»

Nonrigid Embeddings for Dimensionality Reduction

16 years 7 days ago

Download www.merl.com

Spectral methods for embedding graphs and immersing data manifolds in low-dimensional speaces are notoriously unstable due to insufﬁcient and/or numberically ill-conditioned con...

Matthew Brand

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers