Sciweavers

508 search results - page 59 / 102
» Learning for stochastic dynamic programming
Sort
View
AAAI
2010
15 years 4 months ago
Relational Partially Observable MDPs
Relational Markov Decision Processes (MDP) are a useraction for stochastic planning problems since one can develop abstract solutions for them that are independent of domain size ...
Chenggang Wang, Roni Khardon
111
Voted
AAAI
1998
15 years 4 months ago
Using Caching to Solve Larger Probabilistic Planning Problems
Probabilistic planning algorithms seek e ective plans for large, stochastic domains. maxplan is a recently developed algorithm that converts a planning problem into an E-Majsat pr...
Stephen M. Majercik, Michael L. Littman
101
Voted
IDA
2007
Springer
15 years 8 months ago
Learning to Align: A Statistical Approach
We present a new machine learning approach to the inverse parametric sequence alignment problem: given as training examples a set of correct pairwise global alignments, find the p...
Elisa Ricci, Tijl De Bie, Nello Cristianini
117
Voted
IJCV
1998
163views more  IJCV 1998»
15 years 2 months ago
CONDENSATION - Conditional Density Propagation for Visual Tracking
The problem of tracking curves in dense visual clutter is challenging. Kalman filtering is inadequate because it is based on Gaussian densities which, being unimodal, cannot repre...
Michael Isard, Andrew Blake
123
Voted
CDC
2010
IEEE
160views Control Systems» more  CDC 2010»
14 years 9 months ago
Adaptive bases for Q-learning
Abstract-- We consider reinforcement learning, and in particular, the Q-learning algorithm in large state and action spaces. In order to cope with the size of the spaces, a functio...
Dotan Di Castro, Shie Mannor