Sciweavers

236 search results - page 15 / 48
» Bias and Variance Approximation in Value Function Estimates
Sort
View
TR
2010
128views Hardware» more  TR 2010»
14 years 11 months ago
Strategy for Planning Accelerated Life Tests With Small Sample Sizes
Previous work on planning accelerated life tests has been based on large-sample approximations to evaluate test plan properties. In this paper, we use more accurate simulation met...
Haiming Ma, William Q. Meeker
NIPS
2008
15 years 5 months ago
Signal-to-Noise Ratio Analysis of Policy Gradient Algorithms
Policy gradient (PG) reinforcement learning algorithms have strong (local) convergence guarantees, but their learning performance is typically limited by a large variance in the e...
John W. Roberts, Russ Tedrake
TOMACS
2010
79views more  TOMACS 2010»
14 years 11 months ago
A stochastic approximation method with max-norm projections and its applications to the Q-learning algorithm
In this paper, we develop a stochastic approximation method to solve a monotone estimation problem and use this method to enhance the empirical performance of the Q-learning algor...
Sumit Kunnumkal, Huseyin Topaloglu
ICML
2002
IEEE
16 years 5 months ago
Non-Disjoint Discretization for Naive-Bayes Classifiers
Previous discretization techniques have discretized numeric attributes into disjoint intervals. We argue that this is neither necessary nor appropriate for naive-Bayes classifiers...
Ying Yang, Geoffrey I. Webb
CDC
2010
IEEE
110views Control Systems» more  CDC 2010»
14 years 11 months ago
Multi-step-ahead multivariate predictors: A comparative analysis
Abstract-- The focus of this article is to undertake a comparative analysis of multi-step-ahead linear multivariate predictors. The approach considered for the estimation will be b...
Marzia Cescon, Rolf Johansson