Sciweavers

1669 search results - page 50 / 334
» SMO-Style Algorithms for Learning Using Privileged Informati...
Sort
View
WWW
2011
ACM
13 years 2 months ago
Learning to re-rank: query-dependent image re-ranking using click data
Our objective is to improve the performance of keyword based image search engines by re-ranking their baseline results. To this end, we address three limitations of existing searc...
Vidit Jain, Manik Varma
CORR
2011
Springer
209views Education» more  CORR 2011»
12 years 11 months ago
Close the Gaps: A Learning-while-Doing Algorithm for a Class of Single-Product Revenue Management Problems
In this work, we consider a retailer selling a single product with limited on-hand inventory over a finite selling season. Customer demand arrives according to a Poisson process,...
Zizhuo Wang, Shiming Deng, Yinyu Ye
COOPIS
2000
IEEE
14 years 4 days ago
Dynamic Pricing with Limited Competitor Information in a Multi-Agent Economy
We study the price dynamics in a multi-agent economy consisting of buyers and competing sellers, where each seller has limited information about its competitors’ prices. In this ...
Prithviraj Dasgupta, Rajarshi Das
COLT
2010
Springer
13 years 5 months ago
Optimal Algorithms for Online Convex Optimization with Multi-Point Bandit Feedback
Bandit convex optimization is a special case of online convex optimization with partial information. In this setting, a player attempts to minimize a sequence of adversarially gen...
Alekh Agarwal, Ofer Dekel, Lin Xiao
AAMAS
2007
Springer
13 years 7 months ago
Parallel Reinforcement Learning with Linear Function Approximation
In this paper, we investigate the use of parallelization in reinforcement learning (RL), with the goal of learning optimal policies for single-agent RL problems more quickly by us...
Matthew Grounds, Daniel Kudenko