The Value-of-Information in Matching with Queues

10 years 2 months ago

Download www.iiis.tsinghua.edu.cn

We consider the problem of optimal matching with queues in dynamic systems and investigate the value-of-information. In such systems, the operators match tasks and resources stored in queues, with the objective of maximizing the system utility of the matching reward proﬁle, minus the average matching cost. This problem appears in many practical systems and the main challenges are the no-underﬂow constraints, and the lack of matching-reward information and system dynamics statistics. We develop two online matching algorithms: Learning-aided Reward optimAl Matching (LRAM) and Dual-LRAM (DRAM) to eﬀectively resolve both challenges. Both algorithms are equipped with a learning module for estimating the matching-reward information, while DRAM incorporates an additional module for learning the system dynamics. We show that both algorithms achieve an O( +δr) close-to-optimal utility performance for any > 0, while DRAM achieves a faster convergence speed and a better delay compared ...

Longbo Huang

Real-time Traffic