Search Sciweavers | Sciweavers

190 search results - page 28 / 38

» An Incremental Sampling-based Algorithm for Stochastic Optim...

190

click to vote

ATAL
2003
Springer

185views Intelligent Agents» more ATAL 2003»

Optimizing information exchange in cooperative multi-agent systems

15 years 12 months ago

Download rbr.cs.umass.edu

Decentralized control of a cooperative multi-agent system is the problem faced by multiple decision-makers that share a common set of objectives. The decision-makers may be robots...

Claudia V. Goldman, Shlomo Zilberstein

claim paper

Read More »

164

click to vote

GECCO
2007
Springer

164views Optimization» more GECCO 2007»

A study of mutational robustness as the product of evolutionary computation

16 years 25 days ago

Download www.cs.bham.ac.uk

This paper investigates the ability of a tournament selection based genetic algorithm to ﬁnd mutationally robust solutions to a simple combinatorial optimization problem. Two di...

Justin Schonfeld

claim paper

Read More »

140

click to vote

CDC
2009
IEEE

111views Control Systems» more CDC 2009»

On fusion of information from multiple sensors in the presence of analog erasure links

15 years 11 months ago

Download ee.nd.edu

— Consider multiple sensors that transmit data over analog erasure links to an estimation center. The sensors have access to distinct entries of the output vector of a linear and...

Vijay Gupta, Nuno C. Martins

claim paper

Read More »

156

click to vote

NIPS
2001

144views Information Technology» more NIPS 2001»

Variance Reduction Techniques for Gradient Estimates in Reinforcement Learning

15 years 8 months ago

Download jmlr.csail.mit.edu

Policy gradient methods for reinforcement learning avoid some of the undesirable properties of the value function approaches, such as policy degradation (Baxter and Bartlett, 2001...

Evan Greensmith, Peter L. Bartlett, Jonathan Baxte...

claim paper

Read More »

179

click to vote

CDC
2010
IEEE

139views Control Systems» more CDC 2010»

Q-learning and enhanced policy iteration in discounted dynamic programming

15 years 1 months ago

Download web.mit.edu

We consider the classical finite-state discounted Markovian decision problem, and we introduce a new policy iteration-like algorithm for finding the optimal state costs or Q-facto...

Dimitri P. Bertsekas, Huizhen Yu

claim paper

Read More »

« Prev « First page 28 / 38 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers