Search Sciweavers | Sciweavers

509 search results - page 28 / 102

» Using Learning for Approximation in Stochastic Processes

127

Voted

NIPS
1998

156views Information Technology» more NIPS 1998»

Learning Nonlinear Dynamical Systems Using an EM Algorithm

15 years 3 months ago

Download www.cs.nyu.edu

The Expectation Maximization EM algorithm is an iterative procedure for maximum likelihood parameter estimation from data sets with missing or hidden variables 2 . It has been app...

Zoubin Ghahramani, Sam T. Roweis

claim paper

Read More »

123

click to vote

ILP
2003
Springer

126views Automated Reasoning» more ILP 2003»

Graph Kernels and Gaussian Processes for Relational Reinforcement Learning

15 years 7 months ago

Download dtai.cs.kuleuven.be

RRL is a relational reinforcement learning system based on Q-learning in relational state-action spaces. It aims to enable agents to learn how to act in an environment that has no ...

Thomas Gärtner, Kurt Driessens, Jan Ramon

claim paper

Read More »

111

click to vote

NIPS
2008

175views Information Technology» more NIPS 2008»

Local Gaussian Process Regression for Real Time Online Model Learning

15 years 3 months ago

Download www.kyb.tuebingen.mpg.de

Learning in real-time applications, e.g., online approximation of the inverse dynamics model for model-based robot control, requires fast online regression techniques. Inspired by...

Duy Nguyen-Tuong, Matthias Seeger, Jan Peters

claim paper

Read More »

142

Voted

JAIR
2008

107views more JAIR 2008»

Planning with Durative Actions in Stochastic Domains

15 years 2 months ago

Download www.cs.washington.edu

Probabilistic planning problems are typically modeled as a Markov Decision Process (MDP). MDPs, while an otherwise expressive model, allow only for sequential, non-durative action...

Mausam, Daniel S. Weld

claim paper

Read More »

150

click to vote

ICML
2006
IEEE

256views Machine Learning» more ICML 2006»

Automatic basis function construction for approximate dynamic programming and reinforcement learning

15 years 8 months ago

Download www.ece.mcgill.ca

We address the problem of automatically constructing basis functions for linear approximation of the value function of a Markov Decision Process (MDP). Our work builds on results ...

Philipp W. Keller, Shie Mannor, Doina Precup

claim paper

Read More »

« Prev « First page 28 / 102 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers