Search Sciweavers | Sciweavers

206

CIMCA
2008
IEEE

125views Intelligent Agents» more CIMCA 2008»

Tree Exploration for Bayesian RL Exploration

16 years 1 months ago

Research in reinforcement learning has produced algorithms for optimal decision making under uncertainty that fall within two main types. The ﬁrst employs a Bayesian framework, ...

Christos Dimitrakakis

posted by olethros

Read More »

196

Voted

COLT
2003
Springer

121views Machine Learning» more COLT 2003»

Lower Bounds on the Sample Complexity of Exploration in the Multi-armed Bandit Problem

15 years 12 months ago

Download www.ece.mcgill.ca

We consider the Multi-armed bandit problem under the PAC (“probably approximately correct”) model. It was shown by Even-Dar et al. [5] that given n arms, it suﬃces to play th...

Shie Mannor, John N. Tsitsiklis

claim paper

Read More »

150

click to vote

AAAI
2006

129views Intelligent Agents» more AAAI 2006»

A Characterization of Interventional Distributions in Semi-Markovian Causal Models

15 years 8 months ago

Download ftp.cs.ucla.edu

We offer a complete characterization of the set of distributions that could be induced by local interventions on variables governed by a causal Bayesian network of unknown structu...

Jin Tian, Changsung Kang, Judea Pearl

claim paper

Read More »

187

click to vote

JAIR
2010

119views more JAIR 2010»

Active Tuples-based Scheme for Bounding Posterior Beliefs

15 years 5 months ago

Download www.ics.uci.edu

The paper presents a scheme for computing lower and upper bounds on the posterior marginals in Bayesian networks with discrete variables. Its power lies in its ability to use any ...

Bozhena Bidyuk, Rina Dechter, Emma Rollon

claim paper

Read More »

173

click to vote

NIPS
2004

101views Information Technology» more NIPS 2004»

Variational Minimax Estimation of Discrete Distributions under KL Loss

15 years 8 months ago

Download books.nips.cc

We develop a family of upper and lower bounds on the worst-case expected KL loss for estimating a discrete distribution on a finite number m of points, given N i.i.d. samples. Our...

Liam Paninski

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers