Sciweavers

1106 search results - page 163 / 222
» On regularization algorithms in learning theory
Sort
View
COLT
2005
Springer
15 years 10 months ago
Ranking and Scoring Using Empirical Risk Minimization
A general model is proposed for studying ranking problems. We investigate learning methods based on empirical minimization of the natural estimates of the ranking risk. The empiric...
Stéphan Clémençon, Gáb...
ICRA
2003
IEEE
123views Robotics» more  ICRA 2003»
15 years 9 months ago
Autonomous reactive control for simulated humanoids
— We present a framework for composing motor controllers into autonomous composite reactive behaviors for bipedal robots and autonomous, physically-simulated humanoids. A key con...
Petros Faloutsos, Michiel van de Panne, Demetri Te...
COLT
2000
Springer
15 years 8 months ago
Bias-Variance Error Bounds for Temporal Difference Updates
We give the first rigorous upper bounds on the error of temporal difference (td) algorithms for policy evaluation as a function of the amount of experience. These upper bounds pr...
Michael J. Kearns, Satinder P. Singh
NIPS
1994
15 years 5 months ago
Combining Estimators Using Non-Constant Weighting Functions
This paper discusses the linearly weighted combination of estimators in which the weighting functions are dependent on the input. We show that the weighting functions can be deriv...
Volker Tresp, Michiaki Taniguchi
ICML
2010
IEEE
15 years 5 months ago
Supervised Aggregation of Classifiers using Artificial Prediction Markets
Prediction markets are used in real life to predict outcomes of interest such as presidential elections. In this work we introduce a mathematical theory for Artificial Prediction ...
Nathan Lay, Adrian Barbu