Sciweavers

54 search results - page 7 / 11
» Second-order Learning Algorithm with Squared Penalty Term
Sort
View
ATAL
2009
Springer
14 years 1 months ago
Online exploration in least-squares policy iteration
One of the key problems in reinforcement learning is balancing exploration and exploitation. Another is learning and acting in large or even continuous Markov decision processes (...
Lihong Li, Michael L. Littman, Christopher R. Mans...
GECCO
2008
Springer
128views Optimization» more  GECCO 2008»
13 years 7 months ago
Adapted Pittsburgh classifier system: building accurate strategies in non markovian environments
This paper focuses on the study of the behavior of a genetic algorithm based classifier system, the Adapted Pittsburgh Classifier System (A.P.C.S), on maze type environments con...
Gilles Énée, Mathias Péroumal...
ALT
2009
Springer
14 years 3 months ago
Complexity versus Agreement for Many Views
Abstract. The paper considers the problem of semi-supervised multiview classification, where each view corresponds to a Reproducing Kernel Hilbert Space. An algorithm based on co-...
Odalric-Ambrym Maillard, Nicolas Vayatis
ICML
2006
IEEE
14 years 7 months ago
Convex optimization techniques for fitting sparse Gaussian graphical models
We consider the problem of fitting a large-scale covariance matrix to multivariate Gaussian data in such a way that the inverse is sparse, thus providing model selection. Beginnin...
Onureena Banerjee, Laurent El Ghaoui, Alexandre d'...
PAMI
2010
225views more  PAMI 2010»
13 years 1 months ago
Semi-Supervised Classification via Local Spline Regression
Abstract--This paper presents local spline regression for semisupervised classification. The core idea in our approach is to introduce splines developed in Sobolev space to map the...
Shiming Xiang, Feiping Nie, Changshui Zhang