Search Sciweavers | Sciweavers

995 search results - page 9 / 199

» Learning Useful Horn Approximations

click to vote

IJCAI
2003

147views Artificial Intelligence» more IJCAI 2003»

Approximate Policy Iteration using Large-Margin Classifiers

13 years 8 months ago

Download ijcai.org

We present an approximate policy iteration algorithm that uses rollouts to estimate the value of each action under a given policy in a subset of states and a classifier to general...

Michail G. Lagoudakis, Ronald Parr

claim paper

Read More »

click to vote

ICML
1995
IEEE

184views Machine Learning» more ICML 1995»

Residual Algorithms: Reinforcement Learning with Function Approximation

14 years 8 months ago

Download www.leemon.com

A number of reinforcement learning algorithms have been developed that are guaranteed to converge to the optimal solution when used with lookup tables. It is shown, however, that ...

Leemon C. Baird III

claim paper

Read More »

click to vote

NIPS
1998

164views Information Technology» more NIPS 1998»

Approximate Learning of Dynamic Models

13 years 8 months ago

Download robotics.stanford.edu

Inference is a key component in learning probabilistic models from partially observable data. When learning temporal models, each of the many inference phases requires a complete ...

Xavier Boyen, Daphne Koller

claim paper

Read More »

click to vote

ICML
2007
IEEE

180views Machine Learning» more ICML 2007»

Tracking value function dynamics to improve reinforcement learning with piecewise linear function approximation

14 years 8 months ago

Download www.machinelearning.org

Reinforcement learning algorithms can become unstable when combined with linear function approximation. Algorithms that minimize the mean-square Bellman error are guaranteed to co...

Chee Wee Phua, Robert Fitch

claim paper

Read More »

click to vote

ML
2012
ACM

385views Machine Learning» more ML 2012»

An alternative view of variational Bayes and asymptotic approximations of free energy

12 years 3 months ago

Download hawaii.naist.jp

Bayesian learning, widely used in many applied data-modeling problems, is often accomplished with approximation schemes because it requires intractable computation of the posterio...

Kazuho Watanabe

claim paper

Read More »

« Prev « First page 9 / 199 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers