Sciweavers

COLT
2008
Springer
13 years 9 months ago
Competing in the Dark: An Efficient Algorithm for Bandit Linear Optimization
We introduce an efficient algorithm for the problem of online linear optimization in the bandit setting which achieves the optimal O ( T) regret. The setting is a natural general...
Jacob Abernethy, Elad Hazan, Alexander Rakhlin
COLT
2008
Springer
13 years 9 months ago
Minimizing Wide Range Regret with Time Selection Functions
Subhash Khot, Ashok Kumar Ponnuswami
COLT
2008
Springer
13 years 9 months ago
Stochastic Linear Optimization under Bandit Feedback
Varsha Dani, Thomas P. Hayes, Sham M. Kakade
COLT
2008
Springer
13 years 9 months ago
Density Estimation in Linear Time
Satyaki Mahalanabis, Daniel Stefankovic
COLT
2008
Springer
13 years 9 months ago
More Efficient Internal-Regret-Minimizing Algorithms
Standard no-internal-regret (NIR) algorithms compute a fixed point of a matrix, and hence typically require O(n3 ) run time per round of learning, where n is the dimensionality of...
Amy R. Greenwald, Zheng Li, Warren Schudy
COLT
2008
Springer
13 years 9 months ago
A Query Algorithm for Agnostically Learning DNF?
Parikshit Gopalan, Adam Kalai, Adam R. Klivans
COLT
2008
Springer
13 years 9 months ago
An Information Theoretic Framework for Multi-view Learning
In the multi-view learning paradigm, the input variable is partitioned into two different views X1 and X2 and there is a target variable Y of interest. The underlying assumption i...
Karthik Sridharan, Sham M. Kakade
COLT
2008
Springer
13 years 9 months ago
Dimension and Margin Bounds for Reflection-invariant Kernels
A kernel over the Boolean domain is said to be reflection-invariant, if its value does not change when we flip the same bit in both arguments. (Many popular kernels have this prop...
Thorsten Doliwa, Michael Kallweit, Hans-Ulrich Sim...
COLT
2008
Springer
13 years 9 months ago
Learning Rotations
An algorithm is presented for online learning of rotations. The proposed algorithm involves matrix exponentiated gradient updates and is motivated by the von Neumann divergence. T...
Adam M. Smith, Manfred K. Warmuth
COLT
2008
Springer
13 years 9 months ago
Adaptive Aggregation for Reinforcement Learning with Efficient Exploration: Deterministic Domains
We propose a model-based learning algorithm, the Adaptive Aggregation Algorithm (AAA), that aims to solve the online, continuous state space reinforcement learning problem in a de...
Andrey Bernstein, Nahum Shimkin