Sciweavers

113
Voted
ICML
1998
IEEE
16 years 3 months ago
Intra-Option Learning about Temporally Abstract Actions
tion Learning about Temporally Abstract Actions Richard S. Sutton Department of Computer Science University of Massachusetts Amherst, MA 01003-4610 rich@cs.umass.edu Doina Precup D...
Richard S. Sutton, Doina Precup, Satinder P. Singh
118
Voted
ICML
1998
IEEE
16 years 3 months ago
Heading in the Right Direction
Stochastic topological models, and hidden Markov models in particular, are a useful tool for robotic navigation and planning. In previous work we have shown how weak odometric dat...
Hagit Shatkay, Leslie Pack Kaelbling
135
Voted
ICML
1998
IEEE
16 years 3 months ago
Value Function Based Production Scheduling
Production scheduling, the problem of sequentially con guring a factory to meet forecasted demands, is a critical problem throughout the manufacturing industry. The requirement of...
Jeff G. Schneider, Justin A. Boyan, Andrew W. Moor...
93
Voted
ICML
1998
IEEE
16 years 3 months ago
Ridge Regression Learning Algorithm in Dual Variables
Craig Saunders, Alexander Gammerman, Volodya Vovk
133
Voted
ICML
1998
IEEE
16 years 3 months ago
RL-TOPS: An Architecture for Modularity and Re-Use in Reinforcement Learning
This paper introduces the RL-TOPs architecture for robot learning, a hybrid system combining teleo-reactive planning and reinforcement learning techniques. The aim of this system ...
Malcolm R. K. Ryan, Mark D. Pendrith
116
Voted
ICML
1998
IEEE
16 years 3 months ago
The Case against Accuracy Estimation for Comparing Induction Algorithms
We analyze critically the use of classi cation accuracy to compare classi ers on natural data sets, providing a thorough investigation using ROC analysis, standard machine learnin...
Foster J. Provost, Tom Fawcett, Ron Kohavi
97
Voted
ICML
1998
IEEE
16 years 3 months ago
A Randomized ANOVA Procedure for Comparing Performance Curves
Three factors are related in analyses of performance curves such as learning curves: the amount of training, the learning algorithm, and performance. Often we want to know whether...
Justus H. Piater, Paul R. Cohen, Xiaoqin Zhang, Mi...
91
Voted
ICML
1998
IEEE
16 years 3 months ago
An Analysis of Direct Reinforcement Learning in Non-Markovian Domains
Mark D. Pendrith, Michael McGarity
121
Voted
ICML
1998
IEEE
16 years 3 months ago
Q2: Memory-Based Active Learning for Optimizing Noisy Continuous Functions
This paper introduces a new algorithm, Q2, foroptimizingthe expected output ofamultiinput noisy continuous function. Q2 is designed to need only a few experiments, it avoids stron...
Andrew W. Moore, Jeff G. Schneider, Justin A. Boya...
99
Voted
ICML
1998
IEEE
16 years 3 months ago
A Case Study in the Use of Theory Revision in Requirements Validation
T. L. McCluskey, Margaret Mary West