Search Sciweavers | Sciweavers

333 search results - page 48 / 67

» Optimal sampling from distributed streams

193

Voted

ICML
2000
IEEE

165views Machine Learning» more ICML 2000»

A Bayesian Framework for Reinforcement Learning

15 years 11 months ago

Download www.ece.uvic.ca

The reinforcement learning problem can be decomposed into two parallel types of inference: (i) estimating the parameters of a model for the underlying process; (ii) determining be...

Malcolm J. A. Strens

claim paper

Read More »

195

click to vote

CDC
2009
IEEE

156views Control Systems» more CDC 2009»

Input design using Markov chains for system identification

15 years 10 months ago

Download www.kth.se

This paper studies the input design problem for system identification where time domain constraints have to be considered. A finite Markov chain is used to model the input of the s...

Chiara Brighenti, Bo Wahlberg, Cristian R. Rojas

claim paper

Read More »

176

click to vote

HPCA
2007
IEEE

116views Distributed And Parallel Com...» more HPCA 2007»

Illustrative Design Space Studies with Microarchitectural Regression Models

16 years 7 months ago

Download www.eecs.harvard.edu

We apply a scalable approach for practical, comprehensive design space evaluation and optimization. This approach combines design space sampling and statistical inference to ident...

Benjamin C. Lee, David M. Brooks

claim paper

Read More »

156

Voted

GECCO
2005
Springer

129views Optimization» more GECCO 2005»

Real-coded crossover as a role of kernel density estimation

16 years 9 days ago

Download www.cs.bham.ac.uk

This paper presents a kernel density estimation method by means of real-coded crossovers. Estimation of density algorithms (EDAs) are evolutionary optimization techniques, which d...

Jun Sakuma, Shigenobu Kobayashi

claim paper

Read More »

196

click to vote

COLT
2010
Springer

191views Machine Learning» more COLT 2010»

Best Arm Identification in Multi-Armed Bandits

15 years 4 months ago

Download www.di.ens.fr

We consider the problem of finding the best arm in a stochastic multi-armed bandit game. The regret of a forecaster is here defined by the gap between the mean reward of the optim...

Jean-Yves Audibert, Sébastien Bubeck, R&eac...

claim paper

Read More »

« Prev « First page 48 / 67 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers