Search Sciweavers | Sciweavers

126

ML
2002
ACM

133views Machine Learning» more ML 2002»

Finite-time Analysis of the Multiarmed Bandit Problem

15 years 4 months ago

Reinforcement learning policies face the exploration versus exploitation dilemma, i.e. the search for a balance between exploring the environment to find profitable actions while t...

Peter Auer, Nicolò Cesa-Bianchi, Paul Fisch...

claim paper

Read More »

131

click to vote

ML
2002
ACM

133views Machine Learning» more ML 2002»

Estimating Generalization Error on Two-Class Datasets Using Out-of-Bag Estimates

15 years 4 months ago

Download www.cs.utsa.edu

For two-class datasets, we provide a method for estimating the generalization error of a bag using out-of-bag estimates. In bagging, each predictor (single hypothesis) is learned ...

Tom Bylander

claim paper

Read More »

92

click to vote

ML
2002
ACM

135views Machine Learning» more ML 2002»

Bayesian Treed Models

15 years 4 months ago

Download math.acadiau.ca

When simple parametric models such as linear regression fail to adequately approximate a relationship across an entire set of data, an alternative may be to consider a partition o...

Hugh A. Chipman, Edward I. George, Robert E. McCul...

claim paper

Read More »

163

Voted

ML
2002
ACM

163views Machine Learning» more ML 2002»

Structural Modelling with Sparse Kernels

15 years 4 months ago

Download users.ecs.soton.ac.uk

A widely acknowledged drawback of many statistical modelling techniques, commonly used in machine learning, is that the resulting model is extremely difficult to interpret. A numb...

Steve R. Gunn, Jaz S. Kandola

claim paper

Read More »

138

click to vote

ML
2002
ACM

143views Machine Learning» more ML 2002»

A Sparse Sampling Algorithm for Near-Optimal Planning in Large Markov Decision Processes

15 years 4 months ago

Download www.cis.upenn.edu

An issue that is critical for the application of Markov decision processes MDPs to realistic problems is how the complexity of planning scales with the size of the MDP. In stochas...

Michael J. Kearns, Yishay Mansour, Andrew Y. Ng

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers