Sciweavers

437 search results - page 55 / 88
» Policy Gradient Critics
Sort
View
PDCAT
2004
Springer
14 years 2 months ago
RT-Grid: A QoS Oriented Service Grid Framework
Effective and efficient Quality of Service (QoS) management is critical for a service grid to meet the requirements of both grid users and service providers. We incorporate QoS man...
Hai Jin, Hanhua Chen, Minghu Zhang, Deqing Zou
CDC
2009
IEEE
133views Control Systems» more  CDC 2009»
14 years 1 months ago
Arbitrarily modulated Markov decision processes
— We consider decision-making problems in Markov decision processes where both the rewards and the transition probabilities vary in an arbitrary (e.g., nonstationary) fashion. We...
Jia Yuan Yu, Shie Mannor
WACC
1999
ACM
14 years 1 months ago
Temporal workflow management in a claim handling system
Temporal workflow management is important for processes that are time-driven. Claim handling, which requires the documentation, diagnosis, and resolution of customer claims due to...
J. Leon Zhao, Edward A. Stohr
VLDB
1989
ACM
72views Database» more  VLDB 1989»
14 years 28 days ago
Priority in DBMS Resource Scheduling
- In this paper, we addressthe problem of priority scheduling in a databasemanagement system. We start by investigating the architectural consequences of adding priority to a DBMS....
Michael J. Carey, Rajiv Jauhari, Miron Livny
ESANN
2008
13 years 10 months ago
Safe exploration for reinforcement learning
In this paper we define and address the problem of safe exploration in the context of reinforcement learning. Our notion of safety is concerned with states or transitions that can ...
Alexander Hans, Daniel Schneegaß, Anton Maxi...