Search Sciweavers | Sciweavers

86 search results - page 14 / 18

» Estimation and Approximation Bounds for Gradient-Based Reinf...

203

click to vote

ICML
2008
IEEE

117views Machine Learning» more ICML 2008»

Sample-based learning and search with permanent and transient memories

16 years 7 months ago

Download www.cs.ualberta.ca

We present a reinforcement learning architecture, Dyna-2, that encompasses both samplebased learning and sample-based search, and that generalises across states during both learni...

David Silver, Martin Müller 0003, Richard S. ...

claim paper

Read More »

193

Voted

ICML
2004
IEEE

134views Machine Learning» more ICML 2004»

Approximate inference by Markov chains on union spaces

16 years 7 months ago

Download www.ics.uci.edu

A standard method for approximating averages in probabilistic models is to construct a Markov chain in the product space of the random variables with the desired equilibrium distr...

Max Welling, Michal Rosen-Zvi, Yee Whye Teh

claim paper

Read More »

200

click to vote

NIPS
1998

164views Information Technology» more NIPS 1998»

Finite-Sample Convergence Rates for Q-Learning and Indirect Algorithms

15 years 8 months ago

Download www.cis.upenn.edu

In this paper, we address two issues of long-standing interest in the reinforcement learning literature. First, what kinds of performance guarantees can be made for Q-learning aft...

Michael J. Kearns, Satinder P. Singh

claim paper

Read More »

203

click to vote

SODA
2008
ACM

184views Algorithms» more SODA 2008»

Coresets, sparse greedy approximation, and the Frank-Wolfe algorithm

15 years 8 months ago

Download www.almaden.ibm.com

The problem of maximizing a concave function f(x) in a simplex S can be solved approximately by a simple greedy algorithm. For given k, the algorithm can find a point x(k) on a k-...

Kenneth L. Clarkson

claim paper

Read More »

223

click to vote

TNN
2010

216views Management» more TNN 2010»

Simplifying mixture models through function approximation

15 years 1 months ago

Download books.nips.cc

Finite mixture model is a powerful tool in many statistical learning problems. In this paper, we propose a general, structure-preserving approach to reduce its model complexity, w...

Kai Zhang, James T. Kwok

claim paper

Read More »

« Prev « First page 14 / 18 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers