Sciweavers

81 search results - page 7 / 17
» The Sample Average Approximation Method for Stochastic Discr...
Sort
View
JAIR
2008
113views more  JAIR 2008»
13 years 8 months ago
Graphical Model Inference in Optimal Control of Stochastic Multi-Agent Systems
In this article we consider the issue of optimal control in collaborative multi-agent systems with stochastic dynamics. The agents have a joint task in which they have to reach a ...
Bart van den Broek, Wim Wiegerinck, Bert Kappen
MOR
2007
140views more  MOR 2007»
13 years 8 months ago
Adaptive Control Variates for Finite-Horizon Simulation
Adaptive Monte Carlo methods are simulation efficiency improvement techniques designed to adaptively tune simulation estimators. Most of the work on adaptive Monte Carlo methods h...
Sujin Kim, Shane G. Henderson
ICML
2000
IEEE
14 years 9 months ago
Reinforcement Learning in POMDP's via Direct Gradient Ascent
This paper discusses theoretical and experimental aspects of gradient-based approaches to the direct optimization of policy performance in controlled ??? ?s. We introduce ??? ?, a...
Jonathan Baxter, Peter L. Bartlett
SODA
2010
ACM
238views Algorithms» more  SODA 2010»
14 years 6 months ago
How good is the Chord algorithm?
The Chord algorithm is a popular, simple method for the succinct approximation of curves, which is widely used, under different names, in a variety of areas, such as, multiobjecti...
Constantinos Daskalakis, Ilias Diakonikolas, Mihal...
HYBRID
2007
Springer
14 years 2 months ago
Robust, Optimal Predictive Control of Jump Markov Linear Systems Using Particles
Hybrid discrete-continuous models, such as Jump Markov Linear Systems, are convenient tools for representing many real-world systems; in the case of fault detection, discrete jumps...
Lars Blackmore, Askar Bektassov, Masahiro Ono, Bri...