Online Learning with Queries

14 years 9 months ago

Download siam.org

The online learning problem requires a player to iteratively choose an action in an unknown and changing environment. In the standard setting of this problem, the player has to choose an action in each round before knowing anything about the corresponding loss. However, there are situations in which it seems possible for the player to spend efforts or resources to collect some prior information before her actions. This motivates us to study a variant of the online learning problem, in which the player is allowed to query B bits from the loss vector in each round before choosing her action. Suppose each loss value is represented by K bits and distinct loss values differ by at least some amount , and suppose there are N actions to choose and T rounds to play. We provide an algorithm for this problem which achieves a regret of the following form. Before B approaching B1 = NK/2, the regret stays at O( T ln N), and after B exceeding B1 but before approaching B2 = NK/2 + 3K/2-1, the regret...

Chao-Kai Chiang, Chi-Jen Lu

Real-time Traffic

Discrete Algorithms | Online Learning Problem | Regret Lower Bounds | Regret Upper Bounds | SODA 2010 |

claim paper

Post Info
More Details (n/a)

Added	01 Mar 2010
Updated	02 Mar 2010
Type	Conference
Year	2010
Where	SODA
Authors	Chao-Kai Chiang, Chi-Jen Lu

Comments (0)

Sciweavers

Online Learning with Queries

Discrete Algorithms | Online Learning Problem | Regret Lower Bounds | Regret Upper Bounds | SODA 2010 |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers