Search Sciweavers | Sciweavers

340 search results - page 42 / 68

» Kernelized value function approximation for reinforcement le...

142

click to vote

JCP
2007

143views more JCP 2007»

Noisy K Best-Paths for Approximate Dynamic Programming with Application to Portfolio Optimization

15 years 2 months ago

Download www.academypublisher.com

Abstract— We describe a general method to transform a non-Markovian sequential decision problem into a supervised learning problem using a K-bestpaths algorithm. We consider an a...

Nicolas Chapados, Yoshua Bengio

claim paper

Read More »

124

click to vote

ICML
2007
IEEE

139views Machine Learning» more ICML 2007»

Learning state-action basis functions for hierarchical MDPs

16 years 3 months ago

Download www.machinelearning.org

This paper introduces a new approach to actionvalue function approximation by learning basis functions from a spectral decomposition of the state-action manifold. This paper exten...

Sarah Osentoski, Sridhar Mahadevan

claim paper

Read More »

120

click to vote

JC
2008

128views more JC 2008»

Lattice rule algorithms for multivariate approximation in the average case setting

15 years 2 months ago

Download www.maths.unsw.edu.au

We study multivariate approximation for continuous functions in the average case setting. The space of d variate continuous functions is equipped with the zero mean Gaussian measu...

Frances Y. Kuo, Ian H. Sloan, Henryk Wozniakowski

claim paper

Read More »

130

Voted

ATAL
2009
Springer

146views Intelligent Agents» more ATAL 2009»

Online exploration in least-squares policy iteration

15 years 9 months ago

Download www.aamas-conference.org

One of the key problems in reinforcement learning is balancing exploration and exploitation. Another is learning and acting in large or even continuous Markov decision processes (...

Lihong Li, Michael L. Littman, Christopher R. Mans...

claim paper

Read More »

131

Voted

ICCV
2009
IEEE

325views Computer Vision» more ICCV 2009»

Bayesian Poisson regression for crowd counting

15 years 7 days ago

Download www.svcl.ucsd.edu

Poisson regression models the noisy output of a counting function as a Poisson random variable, with a log-mean parameter that is a linear function of the input vector. In this wo...

Antoni B. Chan, Nuno Vasconcelos

claim paper

Read More »

« Prev « First page 42 / 68 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers