Search Sciweavers | Sciweavers

1669 search results - page 50 / 334

» SMO-Style Algorithms for Learning Using Privileged Informati...

253

click to vote

WWW
2011
ACM

297views Internet Technology» more WWW 2011»

Learning to re-rank: query-dependent image re-ranking using click data

15 years 2 months ago

Download vis-www.cs.umass.edu

Our objective is to improve the performance of keyword based image search engines by re-ranking their baseline results. To this end, we address three limitations of existing searc...

Vidit Jain, Manik Varma

claim paper

Read More »

204

click to vote

CORR
2011
Springer

209views Education» more CORR 2011»

Close the Gaps: A Learning-while-Doing Algorithm for a Class of Single-Product Revenue Management Problems

14 years 11 months ago

Download www.stanford.edu

In this work, we consider a retailer selling a single product with limited on-hand inventory over a ﬁnite selling season. Customer demand arrives according to a Poisson process,...

Zizhuo Wang, Shiming Deng, Yinyu Ye

claim paper

Read More »

180

click to vote

COOPIS
2000
IEEE

139views Information Technology» more COOPIS 2000»

Dynamic Pricing with Limited Competitor Information in a Multi-Agent Economy

15 years 11 months ago

Download faculty.ist.unomaha.edu

We study the price dynamics in a multi-agent economy consisting of buyers and competing sellers, where each seller has limited information about its competitors’ prices. In this ...

Prithviraj Dasgupta, Rajarshi Das

claim paper

Read More »

239

click to vote

COLT
2010
Springer

217views Machine Learning» more COLT 2010»

Optimal Algorithms for Online Convex Optimization with Multi-Point Bandit Feedback

15 years 5 months ago

Download www.eecs.berkeley.edu

Bandit convex optimization is a special case of online convex optimization with partial information. In this setting, a player attempts to minimize a sequence of adversarially gen...

Alekh Agarwal, Ofer Dekel, Lin Xiao

claim paper

Read More »

162

click to vote

AAMAS
2007
Springer

142views Intelligent Agents» more AAMAS 2007»

Parallel Reinforcement Learning with Linear Function Approximation

15 years 7 months ago

Download www.aamas-conference.org

In this paper, we investigate the use of parallelization in reinforcement learning (RL), with the goal of learning optimal policies for single-agent RL problems more quickly by us...

Matthew Grounds, Daniel Kudenko

claim paper

Read More »

« Prev « First page 50 / 334 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers