Sciweavers

190 search results - page 28 / 38
» An Incremental Sampling-based Algorithm for Stochastic Optim...
Sort
View
ATAL
2003
Springer
14 years 1 months ago
Optimizing information exchange in cooperative multi-agent systems
Decentralized control of a cooperative multi-agent system is the problem faced by multiple decision-makers that share a common set of objectives. The decision-makers may be robots...
Claudia V. Goldman, Shlomo Zilberstein
GECCO
2007
Springer
164views Optimization» more  GECCO 2007»
14 years 2 months ago
A study of mutational robustness as the product of evolutionary computation
This paper investigates the ability of a tournament selection based genetic algorithm to find mutationally robust solutions to a simple combinatorial optimization problem. Two di...
Justin Schonfeld
CDC
2009
IEEE
111views Control Systems» more  CDC 2009»
14 years 1 months ago
On fusion of information from multiple sensors in the presence of analog erasure links
— Consider multiple sensors that transmit data over analog erasure links to an estimation center. The sensors have access to distinct entries of the output vector of a linear and...
Vijay Gupta, Nuno C. Martins
NIPS
2001
13 years 10 months ago
Variance Reduction Techniques for Gradient Estimates in Reinforcement Learning
Policy gradient methods for reinforcement learning avoid some of the undesirable properties of the value function approaches, such as policy degradation (Baxter and Bartlett, 2001...
Evan Greensmith, Peter L. Bartlett, Jonathan Baxte...
CDC
2010
IEEE
139views Control Systems» more  CDC 2010»
13 years 3 months ago
Q-learning and enhanced policy iteration in discounted dynamic programming
We consider the classical finite-state discounted Markovian decision problem, and we introduce a new policy iteration-like algorithm for finding the optimal state costs or Q-facto...
Dimitri P. Bertsekas, Huizhen Yu