Sciweavers

632 search results - page 77 / 127
» Updating with incomplete observations
Sort
View
ECML
2007
Springer
14 years 3 months ago
Policy Gradient Critics
We present Policy Gradient Actor-Critic (PGAC), a new model-free Reinforcement Learning (RL) method for creating limited-memory stochastic policies for Partially Observable Markov ...
Daan Wierstra, Jürgen Schmidhuber
LFCS
2007
Springer
14 years 3 months ago
Model Checking Knowledge and Linear Time: PSPACE Cases
We present a general algorithm scheme for model checking logics of knowledge, common knowledge and linear time, based on simulations to a class of structures that capture the way t...
Kai Engelhardt, Peter Gammie, Ron van der Meyden
VLDB
2007
ACM
116views Database» more  VLDB 2007»
14 years 2 months ago
K-Anonymization as Spatial Indexing: Toward Scalable and Incremental Anonymization
In this paper we observe that k-anonymizing a data set is strikingly similar to building a spatial index over the data set, so similar in fact that classical spatial indexing tech...
Tochukwu Iwuchukwu, Jeffrey F. Naughton
ATAL
2005
Springer
14 years 2 months ago
Using decision-theoretic models to enhance agent system survivability
A survivable agent system depends on the incorporation of many recovery features. However, the optimal use of these features requires the ability to assess the actual state of the...
Anthony R. Cassandra, Marian H. Nodine, Shilpa Bon...
HICSS
2003
IEEE
123views Biometrics» more  HICSS 2003»
14 years 2 months ago
Issues in Rational Planning in Multi-Agent Settings
We adopt the decision-theoretic principle of expected utility maximization as a paradigm for designing autonomous rational agents operating in multi-agent environments. We use the...
Piotr J. Gmytrasiewicz