Search Sciweavers | Sciweavers

360 search results - page 11 / 72

» Learning Evaluation Functions for Large Acyclic Domains

click to vote

JMLR
2010

113views more JMLR 2010»

Optimal Search on Clustered Structural Constraint for Learning Bayesian Network Structure

13 years 3 months ago

Download jmlr.csail.mit.edu

We study the problem of learning an optimal Bayesian network in a constrained search space; skeletons are compelled to be subgraphs of a given undirected graph called the super-st...

Kaname Kojima, Eric Perrier, Seiya Imoto, Satoru M...

claim paper

Read More »

click to vote

ICML
2007
IEEE

139views Machine Learning» more ICML 2007»

Learning state-action basis functions for hierarchical MDPs

14 years 9 months ago

Download www.machinelearning.org

This paper introduces a new approach to actionvalue function approximation by learning basis functions from a spectral decomposition of the state-action manifold. This paper exten...

Sarah Osentoski, Sridhar Mahadevan

claim paper

Read More »

click to vote

ICML
2006
IEEE

167views Machine Learning» more ICML 2006»

On a theory of learning with similarity functions

14 years 9 months ago

Download www.cc.gatech.edu

Kernel functions have become an extremely popular tool in machine learning, with an attractive theory as well. This theory views a kernel as implicitly mapping data points into a ...

Maria-Florina Balcan, Avrim Blum

claim paper

Read More »

click to vote

ML
2008
ACM

110views Machine Learning» more ML 2008»

A theory of learning with similarity functions

13 years 7 months ago

Download www.cs.cmu.edu

Kernel functions have become an extremely popular tool in machine learning, with an attractive theory as well. This theory views a kernel as implicitly mapping data points into a ...

Maria-Florina Balcan, Avrim Blum, Nathan Srebro

claim paper

Read More »

click to vote

ICML
2008
IEEE

165views Machine Learning» more ICML 2008»

A worst-case comparison between temporal difference and residual gradient with linear function approximation

14 years 9 months ago

Download www.research.rutgers.edu

Residual gradient (RG) was proposed as an alternative to TD(0) for policy evaluation when function approximation is used, but there exists little formal analysis comparing them ex...

Lihong Li

claim paper

Read More »

« Prev « First page 11 / 72 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers