ALT 2007 | Sciweavers

145

Voted

ALT
2007
Springer

134views Machine Learning» more ALT 2007»

Tuning Bandit Algorithms in Stochastic Environments

16 years 3 months ago

Algorithms based on upper-conﬁdence bounds for balancing exploration and exploitation are gaining popularity since they are easy to implement, eﬃcient and eﬀective. In this p...

Jean-Yves Audibert, Rémi Munos, Csaba Szepe...

claim paper

Read More »

128

Voted

ALT
2007
Springer

62views Machine Learning» more ALT 2007»

Prescribed Learning of R.E. Classes

16 years 3 months ago

Download www.comp.nus.edu.sg

Abstract. This work extends studies of Angluin, Lange and Zeugmann on the dependence of learning on the hypotheses space chosen for the class. In subsequent investigations, uniform...

Sanjay Jain, Frank Stephan, Nan Ye

claim paper

Read More »

161

click to vote

ALT
2007
Springer

119views Machine Learning» more ALT 2007»

Pseudometrics for State Aggregation in Average Reward Markov Decision Processes

16 years 3 months ago

Download personal.unileoben.ac.at

We consider how state similarity in average reward Markov decision processes (MDPs) may be described by pseudometrics. Introducing the notion of adequate pseudometrics which are we...

Ronald Ortner

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers