Search Sciweavers | Sciweavers

Multiple data sources containing diﬀerent types of features may be available for a given task. For instance, users’ proﬁles can be used to build recommendation systems. In a...

Xiaoxiao Shi, Jean-François Paiement, David...

claim paper

Read More »

132

click to vote

COLT
2008
Springer

115views Machine Learning» more COLT 2008»

Competing in the Dark: An Efficient Algorithm for Bandit Linear Optimization

15 years 6 months ago

Download www-stat.wharton.upenn.edu

We introduce an efficient algorithm for the problem of online linear optimization in the bandit setting which achieves the optimal O ( T) regret. The setting is a natural general...

Jacob Abernethy, Elad Hazan, Alexander Rakhlin

claim paper

Read More »

141

click to vote

ICML
1996
IEEE

162views Machine Learning» more ICML 1996»

Sensitive Discount Optimality: Unifying Discounted and Average Reward Reinforcement Learning

16 years 5 months ago

Download reference.kfupm.edu.sa

Research in reinforcementlearning (RL)has thus far concentrated on two optimality criteria: the discounted framework, which has been very well-studied, and the averagereward frame...

Sridhar Mahadevan

claim paper

Read More »

164

click to vote

PPSN
2004
Springer

209views Distributed And Parallel Com...» more PPSN 2004»

Coupling of Evolution and Learning to Optimize a Hierarchical Object Recognition Model

15 years 9 months ago

Download www.techfak.uni-bielefeld.de

Abstract. A key problem in designing artiﬁcial neural networks for visual object recognition tasks is the proper choice of the network architecture. Evolutionary optimization met...

Georg Schneider, Heiko Wersing, Bernhard Sendhoff,...

claim paper

Read More »

« Prev « First page 42 / 677 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers