Search Sciweavers | Sciweavers

263 search results - page 11 / 53

» Regret Bounds for Prediction Problems

216

click to vote

INFOCOM
2010
IEEE

207views Communications» more INFOCOM 2010»

Opportunistic Spectrum Access with Multiple Users: Learning under Competition

15 years 6 months ago

Download www.mit.edu

Abstract—The problem of cooperative allocation among multiple secondary users to maximize cognitive system throughput is considered. The channel availability statistics are initi...

Animashree Anandkumar, Nithin Michael, Ao Tang

claim paper

Read More »

181

click to vote

CORR
2010
Springer

116views Education» more CORR 2010»

Adaptive Bound Optimization for Online Convex Optimization

15 years 7 months ago

Download www.colt2010.org

We introduce a new online convex optimization algorithm that adaptively chooses its regularization function based on the loss functions observed so far. This is in contrast to pre...

H. Brendan McMahan, Matthew J. Streeter

claim paper

Read More »

187

Voted

NIPS
2007

135views Information Technology» more NIPS 2007»

The Price of Bandit Information for Online Optimization

15 years 9 months ago

Download books.nips.cc

In the online linear optimization problem, a learner must choose, in each round, a decision from a set D ⊂ Rn in order to minimize an (unknown and changing) linear cost function...

Varsha Dani, Thomas P. Hayes, Sham Kakade

claim paper

Read More »

207

click to vote

CORR
2010
Springer

105views Education» more CORR 2010»

Optimism in Reinforcement Learning Based on Kullback-Leibler Divergence

15 years 6 months ago

Download hal.archives-ouvertes.fr

We consider model-based reinforcement learning in ﬁnite Markov Decision Processes (MDPs), focussing on so-called optimistic strategies. Optimism is usually implemented by carryin...

Sarah Filippi, Olivier Cappé, Aurelien Gari...

claim paper

Read More »

209

click to vote

IOR
2011

96views more IOR 2011»

On the Minimax Complexity of Pricing in a Changing Environment

15 years 2 months ago

Download www.columbia.edu

We consider a pricing problem in an environment where the customers’ willingness-to-pay (WtP) distribution may change at some point over the selling horizon. Customers arrive se...

Omar Besbes, Assaf J. Zeevi

claim paper

Read More »

« Prev « First page 11 / 53 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers