Sciweavers

373 search results - page 41 / 75
» Building Relational World Models for Reinforcement Learning
Sort
View
IAT
2010
IEEE
13 years 5 months ago
Selecting Operator Queries Using Expected Myopic Gain
When its human operator cannot continuously supervise (much less teleoperate) an agent, the agent should be able to recognize its limitations and ask for help when it risks making...
Robert Cohn, Michael Maxim, Edmund H. Durfee, Sati...
AAAI
2008
13 years 10 months ago
Transfer Learning via Dimensionality Reduction
Transfer learning addresses the problem of how to utilize plenty of labeled data in a source domain to solve related but different problems in a target domain, even when the train...
Sinno Jialin Pan, James T. Kwok, Qiang Yang
ICANN
2010
Springer
13 years 8 months ago
A Bilinear Model for Consistent Topographic Representations
Visual recognition faces the difficult problem of recognizing objects despite the multitude of their appearances. Ample neuroscientific evidence shows that the cortex uses a topogr...
Urs Bergmann, Christoph von der Malsburg
EMNLP
2011
12 years 7 months ago
Watermarking the Outputs of Structured Prediction with an application in Statistical Machine Translation
We propose a general method to watermark and probabilistically identify the structured outputs of machine learning algorithms. Our method is robust to local editing operations and...
Ashish Venugopal, Jakob Uszkoreit, David Talbot, F...
CHI
2009
ACM
14 years 8 months ago
EnsembleMatrix: interactive visualization to support machine learning with multiple classifiers
Machine learning is an increasingly used computational tool within human-computer interaction research. While most researchers currently utilize an iterative approach to refining ...
Justin Talbot, Bongshin Lee, Ashish Kapoor, Desney...