Sciweavers

983 search results - page 27 / 197
» A Better Update Policy
Sort
View
POLICY
2005
Springer
15 years 9 months ago
Using Linear Temporal Model Checking for Goal-Oriented Policy Refinement Frameworks
Policy refinement is meant to derive lower-level policies from higher-level ones so that these more specific policies are better suited for use in different execution environments...
Javier Rubio-Loyola, Joan Serrat, Marinos Charalam...
109
Voted
IJSYSC
2008
82views more  IJSYSC 2008»
15 years 3 months ago
Supply-chain modelling and control under proportional inventory-replenishment policies
A novel state-space model of a multi-node supply chain is presented, controlled via local proportional inventory-replenishment policies. The model is driven by a stochastic sequen...
C. I. Papanagnou, G. D. Halikias
155
Voted
ECML
2005
Springer
15 years 9 months ago
Natural Actor-Critic
This paper investigates a novel model-free reinforcement learning architecture, the Natural Actor-Critic. The actor updates are based on stochastic policy gradients employing Amari...
Jan Peters, Sethu Vijayakumar, Stefan Schaal
107
Voted
PRIMA
2007
Springer
15 years 10 months ago
Multiagent Planning with Trembling-Hand Perfect Equilibrium in Multiagent POMDPs
Multiagent Partially Observable Markov Decision Processes are a popular model of multiagent systems with uncertainty. Since the computational cost for finding an optimal joint pol...
Yuichi Yabu, Makoto Yokoo, Atsushi Iwasaki
PLDI
2005
ACM
15 years 9 months ago
Composing security policies with polymer
We introduce a language and system that supports definition and composition of complex run-time security policies for Java applications. Our policies are comprised of two sorts o...
Lujo Bauer, Jay Ligatti, David Walker