Search Sciweavers | Sciweavers

86 search results - page 11 / 18

» Estimation and Approximation Bounds for Gradient-Based Reinf...

179

click to vote

GECCO
2006
Springer

177views Optimization» more GECCO 2006»

Hyper-ellipsoidal conditions in XCS: rotation, linear approximation, and solution structure

15 years 10 months ago

Download www.eskimo.com

The learning classifier system XCS is an iterative rulelearning system that evolves rule structures based on gradient-based prediction and rule quality estimates. Besides classifi...

Martin V. Butz, Pier Luca Lanzi, Stewart W. Wilson

claim paper

Read More »

180

click to vote

DAM
2007

84views more DAM 2007»

Estimates of covering numbers of convex sets with slowly decaying orthogonal subsets

15 years 7 months ago

Download www.dist.unige.it

Covering numbers of precompact symmetric convex subsets of Hilbert spaces are investigated. Lower bounds are derived for sets containing orthogonal subsets with norms of their ele...

Vera Kurková, Marcello Sanguineti

claim paper

Read More »

165

click to vote

COLT
2006
Springer

100views Machine Learning» more COLT 2006»

Unifying Divergence Minimization and Statistical Inference Via Convex Duality

15 years 10 months ago

Download ttic.uchicago.edu

Abstract. In this paper we unify divergence minimization and statistical inference by means of convex duality. In the process of doing so, we prove that the dual of approximate max...

Yasemin Altun, Alexander J. Smola

claim paper

Read More »

220

click to vote

PKDD
2009
Springer

184views Data Mining» more PKDD 2009»

Boosting Active Learning to Optimality: A Tractable Monte-Carlo, Billiard-Based Algorithm

15 years 11 months ago

Download www.lri.fr

Abstract. This paper focuses on Active Learning with a limited number of queries; in application domains such as Numerical Engineering, the size of the training set might be limite...

Philippe Rolet, Michèle Sebag, Olivier Teyt...

claim paper

Read More »

225

click to vote

CORR
2010
Springer

204views Education» more CORR 2010»

Predictive State Temporal Difference Learning

15 years 5 months ago

Download www.cs.cmu.edu

We propose a new approach to value function approximation which combines linear temporal difference reinforcement learning with subspace identiﬁcation. In practical applications...

Byron Boots, Geoffrey J. Gordon

claim paper

Read More »

« Prev « First page 11 / 18 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers