Search Sciweavers | Sciweavers

829 search results - page 2 / 166

» Bandit Algorithms for Tree Search

click to vote

ICML
2009
IEEE

170views Machine Learning» more ICML 2009»

Interactively optimizing information retrieval systems as a dueling bandits problem

14 years 8 months ago

Download www.yisongyue.com

We present an on-line learning framework tailored towards real-time learning from observed user behavior in search engines and other information retrieval systems. In particular, ...

Yisong Yue, Thorsten Joachims

claim paper

Read More »

click to vote

ACG
2009
Springer

269views Computer Graphics» more ACG 2009»

Adding Expert Knowledge and Exploration in Monte-Carlo Tree Search

14 years 2 months ago

Download www.personeel.unimaas.nl

Abstract. We present a new exploration term, more eﬃcient than classical UCT-like exploration terms. It combines eﬃciently expert rules, patterns extracted from datasets, All-M...

Guillaume Chaslot, Christophe Fiter, Jean-Baptiste...

claim paper

Read More »

click to vote

CORR
2010
Springer

189views Education» more CORR 2010»

An Optimal Dynamic Mechanism for Multi-Armed Bandit Processes

13 years 7 months ago

Download research.microsoft.com

We consider the problem of revenue-optimal dynamic mechanism design in settings where agents' types evolve over time as a function of their (both public and private) experien...

Sham M. Kakade, Ilan Lobel, Hamid Nazerzadeh

claim paper

Read More »

click to vote

IANDC
2008

101views more IANDC 2008»

Trees with exponentially growing costs

13 years 7 months ago

Download www.cs.ust.hk

We investigate code trees and search trees with cost functions that increase exponentially with the depth in the tree. While corresponding coding theorems have been considered in ...

Frank Schulz

claim paper

Read More »

click to vote

CIMCA
2008
IEEE

125views Intelligent Agents» more CIMCA 2008»

Tree Exploration for Bayesian RL Exploration

14 years 1 months ago

Download arxiv.org

Research in reinforcement learning has produced algorithms for optimal decision making under uncertainty that fall within two main types. The ﬁrst employs a Bayesian framework, ...

Christos Dimitrakakis

posted by olethros

Read More »

« Prev « First page 2 / 166 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers