Sciweavers

2175 search results - page 14 / 435
» Model-free Learning from Demonstration
Sort
View
CORR
2006
Springer
126views Education» more  CORR 2006»
13 years 7 months ago
Evaluating the Robustness of Learning from Implicit Feedback
This paper evaluates the robustness of learning from implicit feedback in web search. In particular, we create a model of user behavior by drawing upon user studies in laboratory ...
Filip Radlinski, Thorsten Joachims
ICML
2009
IEEE
14 years 8 months ago
Deep learning from temporal coherence in video
This work proposes a learning method for deep architectures that takes advantage of sequential data, in particular from the temporal coherence that naturally exists in unlabeled v...
Hossein Mobahi, Ronan Collobert, Jason Weston
WWW
2004
ACM
14 years 8 months ago
Dealing with different distributions in learning from
In the problem of learning with positive and unlabeled examples, existing research all assumes that positive examples P and the hidden positive examples in the unlabeled set U are...
Xiaoli Li, Bing Liu
ECAI
2006
Springer
13 years 11 months ago
Learning by Automatic Option Discovery from Conditionally Terminating Sequences
Abstract. This paper proposes a novel approach to discover options in the form of conditionally terminating sequences, and shows how they can be integrated into reinforcement learn...
Sertan Girgin, Faruk Polat, Reda Alhajj
JMLR
2010
136views more  JMLR 2010»
13 years 2 months ago
Reducing Label Complexity by Learning From Bags
We consider a supervised learning setting in which the main cost of learning is the number of training labels and one can obtain a single label for a bag of examples, indicating o...
Sivan Sabato, Nathan Srebro, Naftali Tishby