Sciweavers

1974 search results - page 129 / 395
» Online learning in online auctions
Sort
View
COLT
2010
Springer
13 years 8 months ago
Optimal Algorithms for Online Convex Optimization with Multi-Point Bandit Feedback
Bandit convex optimization is a special case of online convex optimization with partial information. In this setting, a player attempts to minimize a sequence of adversarially gen...
Alekh Agarwal, Ofer Dekel, Lin Xiao
COMBINATORICA
2010
13 years 7 months ago
Online vertex-coloring games in random graphs
Consider the following one-player game. The vertices of a random graph on n vertices are revealed to the player one by one. In each step, also all edges connecting the newly reveal...
Martin Marciniszyn, Reto Spöhel
JMLR
2010
119views more  JMLR 2010»
13 years 5 months ago
A Convergent Online Single Time Scale Actor Critic Algorithm
Actor-Critic based approaches were among the first to address reinforcement learning in a general setting. Recently, these algorithms have gained renewed interest due to their gen...
Dotan Di Castro, Ron Meir
ICPR
2008
IEEE
14 years 4 months ago
Signature verification based on fusion of on-line and off-line kernels
The problem of signature verification is considered within the bounds of the kernel-based methodology of pattern recognition, more specifically, SVM principle of machine learning....
Vadim Mottl, Mikhail Lange, Valentina Sulimova, Al...
COLT
2003
Springer
14 years 3 months ago
Internal Regret in On-Line Portfolio Selection
This paper extends the game-theoretic notion of internal regret to the case of on-line potfolio selection problems. New sequential investment strategies are designed to minimize th...
Gilles Stoltz, Gábor Lugosi