Search Sciweavers | Sciweavers

377 search results - page 10 / 76

» Convergence of Stochastic Iterative Dynamic Programming Algo...

169

click to vote

UAI
2008

192views Artificial Intelligence» more UAI 2008»

Sparse Stochastic Finite-State Controllers for POMDPs

15 years 8 months ago

Download www.aaai.org

Bounded policy iteration is an approach to solving infinitehorizon POMDPs that represents policies as stochastic finitestate controllers and iteratively improves a controller by a...

Eric A. Hansen

claim paper

Read More »

171

click to vote

TOMACS
2010

79views more TOMACS 2010»

A stochastic approximation method with max-norm projections and its applications to the Q-learning algorithm

15 years 1 months ago

Download legacy.orie.cornell.edu

In this paper, we develop a stochastic approximation method to solve a monotone estimation problem and use this method to enhance the empirical performance of the Q-learning algor...

Sumit Kunnumkal, Huseyin Topaloglu

claim paper

Read More »

207

click to vote

CORR
2012
Springer

235views Education» more CORR 2012»

An Incremental Sampling-based Algorithm for Stochastic Optimal Control

14 years 2 months ago

Download www.mit.edu

Abstract— In this paper, we consider a class of continuoustime, continuous-space stochastic optimal control problems. Building upon recent advances in Markov chain approximation ...

Vu Anh Huynh, Sertac Karaman, Emilio Frazzoli

claim paper

Read More »

164

click to vote

ANOR
2005

81views more ANOR 2005»

Managing Stochastic, Finite Capacity, Multi-Project Systems through the Cross-Entropy Methodology

15 years 6 months ago

Download www.technion.ac.il

This paper addresses the problem of loading a finite capacity, stochastic (random) and dynamic multi-project system. The system is controlled by keeping a constant number of projec...

Izack Cohen, Boaz Golany, Avraham Shtub

claim paper

Read More »

220

click to vote

AAAI
2008

154views Intelligent Agents» more AAAI 2008»

An Efficient Motion Planning Algorithm for Stochastic Dynamic Systems with Constraints on Probability of Failure

15 years 9 months ago

Download groups.csail.mit.edu

When controlling dynamic systems, such as mobile robots in uncertain environments, there is a trade off between risk and reward. For example, a race car can turn a corner faster b...

Masahiro Ono, Brian C. Williams

claim paper

Read More »

« Prev « First page 10 / 76 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers