Sciweavers

683 search results - page 112 / 137
» Coarticulation in Markov Decision Processes
Sort
View
INFOCOM
2009
IEEE
14 years 3 months ago
Network Bandwidth Allocation via Distributed Auctions with Time Reservations
—This paper studies the problem of allocating network capacity through periodic auctions. Motivated primarily by a service overlay architecture, we impose the following condition...
Pablo Belzarena, Andrés Ferragut, Fernando ...
ATAL
2009
Springer
14 years 3 months ago
SarsaLandmark: an algorithm for learning in POMDPs with landmarks
Reinforcement learning algorithms that use eligibility traces, such as Sarsa(λ), have been empirically shown to be effective in learning good estimated-state-based policies in pa...
Michael R. James, Satinder P. Singh
CDC
2008
IEEE
117views Control Systems» more  CDC 2008»
14 years 3 months ago
Event-based optimization for dispatching policies in material handling systems of general assembly lines
—A material handling (MH) system of a general assembly line dispatching parts from inventory to working buffers could be complicated and costly to operate. Generally it is extrem...
Yanjia Zhao, Qianchuan Zhao, Qing-Shan Jia, Xiaoho...
CDC
2008
IEEE
197views Control Systems» more  CDC 2008»
14 years 3 months ago
Dynamic spectrum access policies for cognitive radio
—We study the problem of dynamic spectrum sensing and access in cognitive radio systems as a partially observed Markov decision process (POMDP). A group of cognitive users cooper...
Jayakrishnan Unnikrishnan, Venugopal V. Veeravalli
CDC
2008
IEEE
204views Control Systems» more  CDC 2008»
14 years 3 months ago
Dynamic ping optimization for surveillance in multistatic sonar buoy networks with energy constraints
— In this paper we study the problem of dynamic optimization of ping schedule in an active sonar buoy network deployed to provide persistent surveillance of a littoral area throu...
Anshu Saksena, I-Jeng Wang