Sciweavers

12753 search results - page 2211 / 2551
» is 2002
Sort
View
ML
2002
ACM
133views Machine Learning» more  ML 2002»
15 years 4 months ago
Finite-time Analysis of the Multiarmed Bandit Problem
Reinforcement learning policies face the exploration versus exploitation dilemma, i.e. the search for a balance between exploring the environment to find profitable actions while t...
Peter Auer, Nicolò Cesa-Bianchi, Paul Fisch...
ML
2002
ACM
133views Machine Learning» more  ML 2002»
15 years 4 months ago
Estimating Generalization Error on Two-Class Datasets Using Out-of-Bag Estimates
For two-class datasets, we provide a method for estimating the generalization error of a bag using out-of-bag estimates. In bagging, each predictor (single hypothesis) is learned ...
Tom Bylander
ML
2002
ACM
135views Machine Learning» more  ML 2002»
15 years 4 months ago
Bayesian Treed Models
When simple parametric models such as linear regression fail to adequately approximate a relationship across an entire set of data, an alternative may be to consider a partition o...
Hugh A. Chipman, Edward I. George, Robert E. McCul...
163
Voted
ML
2002
ACM
163views Machine Learning» more  ML 2002»
15 years 4 months ago
Structural Modelling with Sparse Kernels
A widely acknowledged drawback of many statistical modelling techniques, commonly used in machine learning, is that the resulting model is extremely difficult to interpret. A numb...
Steve R. Gunn, Jaz S. Kandola
ML
2002
ACM
143views Machine Learning» more  ML 2002»
15 years 4 months ago
A Sparse Sampling Algorithm for Near-Optimal Planning in Large Markov Decision Processes
An issue that is critical for the application of Markov decision processes MDPs to realistic problems is how the complexity of planning scales with the size of the MDP. In stochas...
Michael J. Kearns, Yishay Mansour, Andrew Y. Ng
« Prev « First page 2211 / 2551 Last » Next »