Sciweavers

508 search results - page 31 / 102
» Learning for stochastic dynamic programming
Sort
View
112
Voted
NIPS
2003
15 years 3 months ago
Policy Search by Dynamic Programming
We consider the policy search approach to reinforcement learning. We show that if a “baseline distribution” is given (indicating roughly how often we expect a good policy to v...
J. Andrew Bagnell, Sham Kakade, Andrew Y. Ng, Jeff...
145
Voted
CPAIOR
2008
Springer
15 years 4 months ago
Amsaa: A Multistep Anticipatory Algorithm for Online Stochastic Combinatorial Optimization
The one-step anticipatory algorithm (1s-AA) is an online algorithm making decisions under uncertainty by ignoring future non-anticipativity constraints. It makes near-optimal decis...
Luc Mercier, Pascal Van Hentenryck
RECOMB
2006
Springer
16 years 2 months ago
Predicting Experimental Quantities in Protein Folding Kinetics Using Stochastic Roadmap Simulation
Abstract. This paper presents a new method for studying protein folding kinetics. It uses the recently introduced Stochastic Roadmap Simulation (SRS) method to estimate the transit...
Tsung-Han Chiang, Mehmet Serkan Apaydin, Douglas L...
HYBRID
2004
Springer
15 years 8 months ago
Inference Methods for Autonomous Stochastic Linear Hybrid Systems
We present a parameter inference algorithm for autonomous stochastic linear hybrid systems, which computes a maximum-likelihood model, given only a set of continuous output data of...
Hamsa Balakrishnan, Inseok Hwang, Jung Soon Jang, ...
112
Voted
ECAI
2010
Springer
15 years 3 months ago
EP for Efficient Stochastic Control with Obstacles
Abstract. We address the problem of continuous stochastic optimal control in the presence of hard obstacles. Due to the non-smooth character of the obstacles, the traditional appro...
Thomas Mensink, Jakob J. Verbeek, Bert Kappen