Search Sciweavers | Sciweavers

25

JMLR
2010

149views more JMLR 2010»

Coherent Inference on Optimal Play in Game Trees

13 years 3 months ago

Round-based games are an instance of discrete planning problems. Some of the best contemporary game tree search algorithms use random roll-outs as data. Relying on a good policy, ...

Philipp Hennig, David H. Stern, Thore Graepel

claim paper

Read More »

32

click to vote

GECCO
2010
Springer

237views Optimization» more GECCO 2010»

Benchmarking the (1, 4)-CMA-ES with mirrored sampling and sequential selection on the noiseless BBOB-2010 testbed

14 years 1 months ago

Download www.lri.fr

The well-known Covariance Matrix Adaptation Evolution Strategy (CMA-ES) is a robust stochastic search algorithm for optimizing functions deﬁned on a continuous search space RD ....

Anne Auger, Dimo Brockhoff, Nikolaus Hansen

claim paper

Read More »

28

click to vote

AUTOMATICA
2006

104views more AUTOMATICA 2006»

Identification of multi-input systems: variance analysis and input design issues

13 years 9 months ago

Download infoscience.epfl.ch

This paper examines the identification of multi-input systems. Motivated by an experiment design problem (should one excite the various inputs simultaneously or separately), we ex...

Michel Gevers, Ljubisa Miskovic, Dominique Bonvin,...

claim paper

Read More »

40

click to vote

ATAL
2005
Springer

181views Intelligent Agents» more ATAL 2005»

Improving reinforcement learning function approximators via neuroevolution

14 years 2 months ago

Download www.aaai.org

Reinforcement learning problems are commonly tackled with temporal difference methods, which use dynamic programming and statistical sampling to estimate the long-term value of ta...

Shimon Whiteson

claim paper

Read More »

28

click to vote

ICASSP
2010
IEEE

224views Signal Processing» more ICASSP 2010»

Distributed learning in cognitive radio networks: Multi-armed bandit with distributed multiple players

13 years 9 months ago

Download www.ece.ucdavis.edu

—We consider a cognitive radio network with distributed multiple secondary users, where each user independently searches for spectrum opportunities in multiple channels without e...

Keqin Liu, Qing Zhao

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers