Sciweavers

ICML
2005
IEEE
14 years 8 months ago
Recycling data for multi-agent learning
Learning agents can improve performance cooperating with other agents, particularly learning agents forming a committee outperform individual agents. This "ensemble effect&qu...
Santiago Ontañón, Enric Plaza
ICML
2005
IEEE
14 years 8 months ago
High speed obstacle avoidance using monocular vision and reinforcement learning
We consider the task of driving a remote control car at high speeds through unstructured outdoor environments. We present an approach in which supervised learning is first used to...
Jeff Michels, Ashutosh Saxena, Andrew Y. Ng
ICML
2005
IEEE
14 years 8 months ago
Comparing clusterings: an axiomatic view
This paper views clusterings as elements of a lattice. Distances between clusterings are analyzed in their relationship to the lattice. From this vantage point, we first give an a...
Marina Meila
ICML
2005
IEEE
14 years 8 months ago
Bounded real-time dynamic programming: RTDP with monotone upper bounds and performance guarantees
MDPs are an attractive formalization for planning, but realistic problems often have intractably large state spaces. When we only need a partial policy to get from a fixed start s...
H. Brendan McMahan, Maxim Likhachev, Geoffrey J. G...
ICML
2005
IEEE
14 years 8 months ago
Proto-value functions: developmental reinforcement learning
This paper presents a novel framework called proto-reinforcement learning (PRL), based on a mathematical model of a proto-value function: these are task-independent basis function...
Sridhar Mahadevan
ICML
2005
IEEE
14 years 8 months ago
Modeling word burstiness using the Dirichlet distribution
Multinomial distributions are often used to model text documents. However, they do not capture well the phenomenon that words in a document tend to appear in bursts: if a word app...
Rasmus Elsborg Madsen, David Kauchak, Charles Elka...
ICML
2005
IEEE
14 years 8 months ago
ROC confidence bands: an empirical evaluation
This paper is about constructing confidence bands around ROC curves. We first introduce to the machine learning community three band-generating methods from the medical field, and...
Sofus A. Macskassy, Foster J. Provost, Saharon Ros...
ICML
2005
IEEE
14 years 8 months ago
Naive Bayes models for probability estimation
Naive Bayes models have been widely used for clustering and classification. However, they are seldom used for general probabilistic learning and inference (i.e., for estimating an...
Daniel Lowd, Pedro Domingos
ICML
2005
IEEE
14 years 8 months ago
Unsupervised evidence integration
Many biological propositions can be supported by a variety of different types of evidence. It is often useful to collect together large numbers of such propositions, together with...
Philip M. Long, Vinay Varadan, Sarah Gilman, Mark ...
ICML
2005
IEEE
14 years 8 months ago
Predicting protein folds with structural repeats using a chain graph model
Protein fold recognition is a key step towards inferring the tertiary structures from amino-acid sequences. Complex folds such as those consisting of interacting structural repeat...
Yan Liu, Eric P. Xing, Jaime G. Carbonell