Search Sciweavers | Sciweavers

1760 search results - page 107 / 352

» Learning from Partial Observations

114

click to vote

EACL
2006
ACL Anthology

173views Natural Language Processing» more EACL 2006»

Generalized Hebbian Algorithm for Incremental Singular Value Decomposition in Natural Language Processing

15 years 4 months ago

Download www.aclweb.org

An algorithm based on the Generalized Hebbian Algorithm is described that allows the singular value decomposition of a dataset to be learned based on single observation pairs pres...

Genevieve Gorrell

claim paper

Read More »

116

click to vote

NIPS
1996

133views Information Technology» more NIPS 1996»

Continuous Sigmoidal Belief Networks Trained using Slice Sampling

15 years 4 months ago

Download www.psi.toronto.edu

Real-valued random hidden variables can be useful for modelling latent structure that explains correlations among observed variables. I propose a simple unit that adds zero-mean G...

Brendan J. Frey

claim paper

Read More »

162

click to vote

Publication

154views

Preference elicitation and inverse reinforcement learning

14 years 5 months ago

Download arxiv.org

We state the problem of inverse reinforcement learning in terms of preference elicitation, resulting in a principled (Bayesian) statistical formulation. This generalises previous w...

Constantin Rothkopf, Christos Dimitrakakis

posted by olethros

Read More »

132

click to vote

CVPR
1997
IEEE

153views Computer Vision» more CVPR 1997»

Learning Generic Prior Models for Visual Computation

15 years 7 months ago

Download www.dam.brown.edu

This paper presents a novel theory for learning generic prior models from a set of observed natural images based on a minimax entropy theory that the authors studied in modeling t...

Song Chun Zhu, David Mumford

claim paper

Read More »

124

click to vote

ICANN
2009
Springer

123views Neural Networks» more ICANN 2009»

Efficient Uncertainty Propagation for Reinforcement Learning with Limited Data

15 years 6 months ago

Download www.tu-ilmenau.de

In a typical reinforcement learning (RL) setting details of the environment are not given explicitly but have to be estimated from observations. Most RL approaches only optimize th...

Alexander Hans, Steffen Udluft

claim paper

Read More »

« Prev « First page 107 / 352 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers