Sciweavers

779 search results - page 87 / 156
» Reinforcement Using Supervised Learning for Policy Generaliz...
Sort
View
ML
2002
ACM
178views Machine Learning» more  ML 2002»
13 years 8 months ago
Metric-Based Methods for Adaptive Model Selection and Regularization
We present a general approach to model selection and regularization that exploits unlabeled data to adaptively control hypothesis complexity in supervised learning tasks. The idea ...
Dale Schuurmans, Finnegan Southey
SAC
2006
ACM
13 years 9 months ago
Combining supervised and unsupervised monitoring for fault detection in distributed computing systems
Fast and accurate fault detection is becoming an essential component of management software for mission critical systems. A good fault detector makes possible to initiate repair a...
Haifeng Chen, Guofei Jiang, Cristian Ungureanu, Ke...
CORR
2010
Springer
152views Education» more  CORR 2010»
13 years 9 months ago
Neuroevolutionary optimization
Temporal difference methods are theoretically grounded and empirically effective methods for addressing reinforcement learning problems. In most real-world reinforcement learning ...
Eva Volná
PREMI
2007
Springer
14 years 3 months ago
Self Adaptable Recognizer for Document Image Collections
Abstract. This paper presents an architecture that enables the recognizer to learn incrementally and, thereby adapt to document image collections for performance improvement. We ar...
Million Meshesha, C. V. Jawahar
NIPS
1996
13 years 10 months ago
Second-order Learning Algorithm with Squared Penalty Term
This paper compares three penalty terms with respect to the efficiency of supervised learning, by using first- and second-order learning algorithms. Our experiments showed that fo...
Kazumi Saito, Ryohei Nakano