Sciweavers

688 search results - page 100 / 138
» Using reinforcement learning to adapt an imitation task
Sort
View
SDM
2011
SIAM
233views Data Mining» more  SDM 2011»
12 years 10 months ago
Multi-Instance Mixture Models
Multi-instance (MI) learning is a variant of supervised learning where labeled examples consist of bags (i.e. multi-sets) of feature vectors instead of just a single feature vecto...
James R. Foulds, Padhraic Smyth
ICML
1996
IEEE
14 years 8 months ago
Learning Evaluation Functions for Large Acyclic Domains
Some of the most successful recent applications of reinforcement learning have used neural networks and the TD algorithm to learn evaluation functions. In this paper, we examine t...
Justin A. Boyan, Andrew W. Moore
KDD
2007
ACM
178views Data Mining» more  KDD 2007»
14 years 8 months ago
Real-time ranking with concept drift using expert advice
In many practical applications, one is interested in generating a ranked list of items using information mined from continuous streams of data. For example, in the context of comp...
Hila Becker, Marta Arias
FLAIRS
2006
13 years 9 months ago
Robot Navigation Using Integrated Retrieval of Behaviors and Routes
RUPART1 is a hybrid robot control system for navigating a real-world, academic building. Hybrid robot control systems provide robust low-level navigation together with strategic p...
Susan Eileen Fox, Peter Anderson-Sprecher
DAGM
2011
Springer
12 years 7 months ago
Agnostic Domain Adaptation
The supervised learning paradigm assumes in general that both training and test data are sampled from the same distribution. When this assumption is violated, we are in the setting...
Alexander Vezhnevets, Joachim M. Buhmann