Search Sciweavers | Sciweavers

508 search results - page 59 / 102

» Learning for stochastic dynamic programming

190

click to vote

AAAI
2010

180views Intelligent Agents» more AAAI 2010»

Relational Partially Observable MDPs

15 years 8 months ago

Download www.cs.tufts.edu

Relational Markov Decision Processes (MDP) are a useraction for stochastic planning problems since one can develop abstract solutions for them that are independent of domain size ...

Chenggang Wang, Roni Khardon

claim paper

Read More »

191

click to vote

AAAI
1998

120views Intelligent Agents» more AAAI 1998»

Using Caching to Solve Larger Probabilistic Planning Problems

15 years 8 months ago

Download www.cs.rutgers.edu

Probabilistic planning algorithms seek e ective plans for large, stochastic domains. maxplan is a recently developed algorithm that converts a planning problem into an E-Majsat pr...

Stephen M. Majercik, Michael L. Littman

claim paper

Read More »

169

click to vote

IDA
2007
Springer

106views Information Technology» more IDA 2007»

Learning to Align: A Statistical Approach

16 years 1 months ago

Download www.tijldebie.net

We present a new machine learning approach to the inverse parametric sequence alignment problem: given as training examples a set of correct pairwise global alignments, ﬁnd the p...

Elisa Ricci, Tijl De Bie, Nello Cristianini

claim paper

Read More »

199

Voted

IJCV
1998

163views more IJCV 1998»

CONDENSATION - Conditional Density Propagation for Visual Tracking

15 years 6 months ago

Download robotics.caltech.edu

The problem of tracking curves in dense visual clutter is challenging. Kalman ﬁltering is inadequate because it is based on Gaussian densities which, being unimodal, cannot repre...

Michael Isard, Andrew Blake

claim paper

Read More »

211

click to vote

CDC
2010
IEEE

160views Control Systems» more CDC 2010»

Adaptive bases for Q-learning

15 years 2 months ago

Download webee.technion.ac.il

Abstract-- We consider reinforcement learning, and in particular, the Q-learning algorithm in large state and action spaces. In order to cope with the size of the spaces, a functio...

Dotan Di Castro, Shie Mannor

claim paper

Read More »

« Prev « First page 59 / 102 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers