Sciweavers

236 search results - page 15 / 48
» Bias and Variance Approximation in Value Function Estimates
Sort
View
TR
2010
128views Hardware» more  TR 2010»
13 years 3 months ago
Strategy for Planning Accelerated Life Tests With Small Sample Sizes
Previous work on planning accelerated life tests has been based on large-sample approximations to evaluate test plan properties. In this paper, we use more accurate simulation met...
Haiming Ma, William Q. Meeker
NIPS
2008
13 years 10 months ago
Signal-to-Noise Ratio Analysis of Policy Gradient Algorithms
Policy gradient (PG) reinforcement learning algorithms have strong (local) convergence guarantees, but their learning performance is typically limited by a large variance in the e...
John W. Roberts, Russ Tedrake
TOMACS
2010
79views more  TOMACS 2010»
13 years 3 months ago
A stochastic approximation method with max-norm projections and its applications to the Q-learning algorithm
In this paper, we develop a stochastic approximation method to solve a monotone estimation problem and use this method to enhance the empirical performance of the Q-learning algor...
Sumit Kunnumkal, Huseyin Topaloglu
ICML
2002
IEEE
14 years 9 months ago
Non-Disjoint Discretization for Naive-Bayes Classifiers
Previous discretization techniques have discretized numeric attributes into disjoint intervals. We argue that this is neither necessary nor appropriate for naive-Bayes classifiers...
Ying Yang, Geoffrey I. Webb
CDC
2010
IEEE
110views Control Systems» more  CDC 2010»
13 years 3 months ago
Multi-step-ahead multivariate predictors: A comparative analysis
Abstract-- The focus of this article is to undertake a comparative analysis of multi-step-ahead linear multivariate predictors. The approach considered for the estimation will be b...
Marzia Cescon, Rolf Johansson