dilemma | Sciweavers

PPOPP
2011
ACM

178views Distributed and Parallel Com...» more PPOPP 2011»

13 years 5 months ago

Time skewing and loop tiling has been known for a long time to be a highly beneﬁcial acceleration technique for nested loops especially on bandwidth hungry multi-core processors...

Robert Strzodka, Mohammed Shaheen, Dawid Pajak

claim paper

Read More »

click to vote

ML
2002
ACM

133views Machine Learning» more ML 2002»

Finite-time Analysis of the Multiarmed Bandit Problem

14 years 2 months ago

Download homes.dsi.unimi.it

Reinforcement learning policies face the exploration versus exploitation dilemma, i.e. the search for a balance between exploring the environment to find profitable actions while t...

Peter Auer, Nicolò Cesa-Bianchi, Paul Fisch...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers