Search Sciweavers | Sciweavers

74 search results - page 8 / 15

» Regret Bounds for Gaussian Process Bandit Problems

click to vote

CIMCA
2008
IEEE

125views Intelligent Agents» more CIMCA 2008»

Tree Exploration for Bayesian RL Exploration

14 years 2 months ago

Download arxiv.org

Research in reinforcement learning has produced algorithms for optimal decision making under uncertainty that fall within two main types. The ﬁrst employs a Bayesian framework, ...

Christos Dimitrakakis

posted by olethros

Read More »

click to vote

NIPS
2004

103views Information Technology» more NIPS 2004»

Experts in a Markov Decision Process

13 years 9 months ago

Download books.nips.cc

We consider an MDP setting in which the reward function is allowed to change during each time step of play (possibly in an adversarial manner), yet the dynamics remain fixed. Simi...

Eyal Even-Dar, Sham M. Kakade, Yishay Mansour

claim paper

Read More »

click to vote

CORR
2010
Springer

105views Education» more CORR 2010»

Optimism in Reinforcement Learning Based on Kullback-Leibler Divergence

13 years 6 months ago

Download hal.archives-ouvertes.fr

We consider model-based reinforcement learning in ﬁnite Markov Decision Processes (MDPs), focussing on so-called optimistic strategies. Optimism is usually implemented by carryin...

Sarah Filippi, Olivier Cappé, Aurelien Gari...

claim paper

Read More »

click to vote

JMLR
2010

155views more JMLR 2010»

Bayesian Gaussian Process Latent Variable Model

13 years 2 months ago

Download jmlr.csail.mit.edu

We introduce a variational inference framework for training the Gaussian process latent variable model and thus performing Bayesian nonlinear dimensionality reduction. This method...

Michalis Titsias, Neil D. Lawrence

claim paper

Read More »

click to vote

JAT
2007

86views more JAT 2007»

Gaussian averages of interpolated bodies and applications to approximate reconstruction

13 years 7 months ago

Download www.math.ualberta.ca

We prove sharp bounds for the expectation of the supremum of the Gaussian process indexed by the intersection of Bn p with ρBn q for 1 ≤ p, q ≤ ∞ and ρ > 0, and by the ...

Y. Gordon, A. E. Litvak, Shahar Mendelson, A. Pajo...

claim paper

Read More »

« Prev « First page 8 / 15 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers