Search Sciweavers | Sciweavers

92 search results - page 12 / 19

» A General Convergence Method for Reinforcement Learning in t...

click to vote

JMLR
2010

161views more JMLR 2010»

Dual Averaging Methods for Regularized Stochastic Learning and Online Optimization

13 years 1 months ago

Download jmlr.csail.mit.edu

We consider regularized stochastic learning and online optimization problems, where the objective function is the sum of two convex terms: one is the loss function of the learning...

Lin Xiao

claim paper

Read More »

click to vote

GECCO
2009
Springer

200views Optimization» more GECCO 2009»

Apply ant colony optimization to Tetris

14 years 1 months ago

Download cs.nju.edu.cn

Tetris is a falling block game where the player’s objective is to arrange a sequence of diﬀerent shaped tetrominoes smoothly in order to survive. In the intelligence games, ag...

Xingguo Chen, Hao Wang, Weiwei Wang, Yinghuan Shi,...

claim paper

Read More »

click to vote

ICML
2000
IEEE

169views Machine Learning» more ICML 2000»

Rates of Convergence for Variable Resolution Schemes in Optimal Control

14 years 7 months ago

Download sequel.futurs.inria.fr

This paper presents a general method to derive tight rates of convergence for numerical approximations in optimal control when we consider variable resolution grids. We study the ...

Andrew W. Moore, Rémi Munos

claim paper

Read More »

click to vote

JMLR
2010

135views more JMLR 2010»

Bundle Methods for Regularized Risk Minimization

13 years 5 months ago

Download www.stat.purdue.edu

A wide variety of machine learning problems can be described as minimizing a regularized risk functional, with diﬀerent algorithms using diﬀerent notions of risk and diﬀeren...

Choon Hui Teo, S. V. N. Vishwanathan, Alex J. Smol...

claim paper

Read More »

click to vote

GECCO
2007
Springer

200views Optimization» more GECCO 2007»

Adaptive variance scaling in continuous multi-objective estimation-of-distribution algorithms

14 years 26 days ago

Download www.cs.bham.ac.uk

Recent research into single–objective continuous Estimation– of–Distribution Algorithms (EDAs) has shown that when maximum–likelihood estimations are used for parametric d...

Peter A. N. Bosman, Dirk Thierens

claim paper

Read More »

« Prev « First page 12 / 19 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers