Sciweavers

ICML
2000
IEEE
15 years 15 days ago
Combining Reinforcement Learning with a Local Control Algorithm
We explore combining reinforcement learning with a hand-crafted local controller in a manner suggested by the chaotic control algorithm of Vincent, Schmitt and Vincent (1994). A c...
Andrew G. Barto, Jette Randløv, Michael T. ...
ICML
2000
IEEE
15 years 15 days ago
Eligibility Traces for Off-Policy Policy Evaluation
Eligibility traces have been shown to speed reinforcement learning, to make it more robust to hidden states, and to provide a link between Monte Carlo and temporal-difference meth...
Doina Precup, Richard S. Sutton, Satinder P. Singh
ICML
2000
IEEE
15 years 15 days ago
Constructive Feature Learning and the Development of Visual Expertise
We present a framework for learning features for visual discrimination. The learning system is exposed to a sequence of training images. Whenever it fails to recognize a visual co...
Justus H. Piater, Roderic A. Grupen
ICML
2000
IEEE
15 years 15 days ago
Meta-Learning by Landmarking Various Learning Algorithms
Landmarking is a novel approach to describing tasks in meta-learning. Previous approaches to meta-learning mostly considered only statistics-inspired measures of the data as a sou...
Bernhard Pfahringer, Hilan Bensusan, Christophe G....
ICML
2000
IEEE
15 years 15 days ago
Learning Probabilistic Models for Decision-Theoretic Navigation of Mobile Robots
Decision-theoretic reasoning and planning algorithms are increasingly being used for mobile robot navigation, due to the signi cant uncertainty accompanying the robots' perce...
Daniel Nikovski, Illah R. Nourbakhsh
ICML
2000
IEEE
15 years 15 days ago
Algorithms for Inverse Reinforcement Learning
Andrew Y. Ng, Stuart J. Russell
ICML
2000
IEEE
15 years 15 days ago
A Boosting Approach to Topic Spotting on Subdialogues
We report the results of a study on topic spotting in conversational speech. Using a machine learning approach, we build classifiers that accept an audio file of conversational hu...
Kary Myers, Michael J. Kearns, Satinder P. Singh, ...
ICML
2000
IEEE
15 years 15 days ago
Rates of Convergence for Variable Resolution Schemes in Optimal Control
This paper presents a general method to derive tight rates of convergence for numerical approximations in optimal control when we consider variable resolution grids. We study the ...
Andrew W. Moore, Rémi Munos