Sciweavers

983 search results - page 27 / 197
» A Better Update Policy
Sort
View
POLICY
2005
Springer
14 years 2 months ago
Using Linear Temporal Model Checking for Goal-Oriented Policy Refinement Frameworks
Policy refinement is meant to derive lower-level policies from higher-level ones so that these more specific policies are better suited for use in different execution environments...
Javier Rubio-Loyola, Joan Serrat, Marinos Charalam...
IJSYSC
2008
82views more  IJSYSC 2008»
13 years 8 months ago
Supply-chain modelling and control under proportional inventory-replenishment policies
A novel state-space model of a multi-node supply chain is presented, controlled via local proportional inventory-replenishment policies. The model is driven by a stochastic sequen...
C. I. Papanagnou, G. D. Halikias
ECML
2005
Springer
14 years 2 months ago
Natural Actor-Critic
This paper investigates a novel model-free reinforcement learning architecture, the Natural Actor-Critic. The actor updates are based on stochastic policy gradients employing Amari...
Jan Peters, Sethu Vijayakumar, Stefan Schaal
PRIMA
2007
Springer
14 years 2 months ago
Multiagent Planning with Trembling-Hand Perfect Equilibrium in Multiagent POMDPs
Multiagent Partially Observable Markov Decision Processes are a popular model of multiagent systems with uncertainty. Since the computational cost for finding an optimal joint pol...
Yuichi Yabu, Makoto Yokoo, Atsushi Iwasaki
PLDI
2005
ACM
14 years 2 months ago
Composing security policies with polymer
We introduce a language and system that supports definition and composition of complex run-time security policies for Java applications. Our policies are comprised of two sorts o...
Lujo Bauer, Jay Ligatti, David Walker