Machine Learning | Sciweavers

107

ICML
2000
IEEE

126views Machine Learning» more ICML 2000»

Reinforcement Learning in POMDP's via Direct Gradient Ascent

16 years 3 months ago

This paper discusses theoretical and experimental aspects of gradient-based approaches to the direct optimization of policy performance in controlled ??? ?s. We introduce ??? ?, a...

Jonathan Baxter, Peter L. Bartlett

claim paper

Read More »

110

click to vote

ICML
2001
IEEE

145views Machine Learning» more ICML 2001»

Symmetry in Markov Decision Processes and its Implications for Single Agent and Multiagent Learning

16 years 3 months ago

Download www-2.cs.cmu.edu

This paper examines the notion of symmetry in Markov decision processes (MDPs). We define symmetry for an MDP and show how it can be exploited for more effective learning in singl...

Martin Zinkevich, Tucker R. Balch

claim paper

Read More »

133

click to vote

ICML
2001
IEEE

203views Machine Learning» more ICML 2001»

Obtaining calibrated probability estimates from decision trees and naive Bayesian classifiers

16 years 3 months ago

Download cseweb.ucsd.edu

Accurate, well-calibrated estimates of class membership probabilities are needed in many supervised learning applications, in particular when a cost-sensitive decision must be mad...

Bianca Zadrozny, Charles Elkan

claim paper

Read More »

129

click to vote

ICML
2001
IEEE

188views Machine Learning» more ICML 2001»

Feature selection for high-dimensional genomic microarray data

16 years 3 months ago

Download www.nslij-genetics.org

We report on the successful application of feature selection methods to a classification problem in molecular biology involving only 72 data points in a 7130 dimensional space. Ou...

Eric P. Xing, Michael I. Jordan, Richard M. Karp

claim paper

Read More »

143

click to vote

ICML
2001
IEEE

149views Machine Learning» more ICML 2001»

Constrained K-means Clustering with Background Knowledge

16 years 3 months ago

Download www.litech.org

Clustering is traditionally viewed as an unsupervised method for data analysis. However, in some cases information about the problem domain is available in addition to the data in...

Kiri Wagstaff, Claire Cardie, Seth Rogers, Stefan ...

claim paper

Read More »

121

click to vote

ICML
2001
IEEE

130views Machine Learning» more ICML 2001»

Learning to Generate Fast Signal Processing Implementations

16 years 3 months ago

Download www.cs.cmu.edu

A single signal processing algorithm can be represented by many mathematically equivalent formulas. However, when these formulas are implemented in code and run on real machines, ...

Bryan Singer, Manuela M. Veloso

claim paper

Read More »

136

click to vote

ICML
2001
IEEE

159views Machine Learning» more ICML 2001»

Direct Policy Search using Paired Statistical Tests

16 years 3 months ago

Download www.autonlab.org

Direct policy search is a practical way to solve reinforcement learning problems involving continuous state and action spaces. The goal becomes finding policy parameters that maxi...

Malcolm J. A. Strens, Andrew W. Moore

claim paper

Read More »

142

click to vote

ICML
2001
IEEE

137views Machine Learning» more ICML 2001»

Smoothed Bootstrap and Statistical Data Cloning for Classifier Evaluation

16 years 3 months ago

Download sci2s.ugr.es

This work is concerned with the estimation of a classifier's accuracy. We first review some existing methods for error estimation, focusing on cross-validation and bootstrap,...

Gregory Shakhnarovich, Ran El-Yaniv, Yoram Baram

claim paper

Read More »

116

click to vote

ICML
2001
IEEE

143views Machine Learning» more ICML 2001»

Scaling Reinforcement Learning toward RoboCup Soccer

16 years 3 months ago

Download www.cs.utexas.edu

Peter Stone, Richard S. Sutton

claim paper