Sciweavers

134 search results - page 4 / 27
» Optimal Quadratic-Form Estimator of the Variance of the Samp...
Sort
View
JMLR
2006
143views more  JMLR 2006»
13 years 8 months ago
Geometric Variance Reduction in Markov Chains: Application to Value Function and Gradient Estimation
We study a sequential variance reduction technique for Monte Carlo estimation of functionals in Markov Chains. The method is based on designing sequential control variates using s...
Rémi Munos
NIPS
2001
13 years 10 months ago
Variance Reduction Techniques for Gradient Estimates in Reinforcement Learning
Policy gradient methods for reinforcement learning avoid some of the undesirable properties of the value function approaches, such as policy degradation (Baxter and Bartlett, 2001...
Evan Greensmith, Peter L. Bartlett, Jonathan Baxte...
ESANN
2003
13 years 10 months ago
Approximately unbiased estimation of conditional variance in heteroscedastic kernel ridge regression
In this paper we extend a form of kernel ridge regression for data characterised by a heteroscedastic noise process (introduced in Foxall et al. [1]) in order to provide approxima...
Gavin C. Cawley, Nicola L. C. Talbot, Robert J. Fo...
KDD
2009
ACM
227views Data Mining» more  KDD 2009»
14 years 9 months ago
Efficiently learning the accuracy of labeling sources for selective sampling
Many scalable data mining tasks rely on active learning to provide the most useful accurately labeled instances. However, what if there are multiple labeling sources (`oracles...
Pinar Donmez, Jaime G. Carbonell, Jeff Schneider
WSC
2007
13 years 11 months ago
Single-stage multiple-comparison procedure for quantiles and other parameters
We present a single-stage multiple-comparison procedure for comparing parameters of independent systems, where the parameters are not necessarily means or steady-state means. We a...
Marvin K. Nakayama