Search Sciweavers | Sciweavers

632 search results - page 77 / 127

» Updating with incomplete observations

click to vote

ECML
2007
Springer

192views Machine Learning» more ECML 2007»

Policy Gradient Critics

14 years 3 months ago

Download www.idsia.ch

We present Policy Gradient Actor-Critic (PGAC), a new model-free Reinforcement Learning (RL) method for creating limited-memory stochastic policies for Partially Observable Markov ...

Daan Wierstra, Jürgen Schmidhuber

claim paper

Read More »

click to vote

LFCS
2007
Springer

115views Artificial Intelligence» more LFCS 2007»

Model Checking Knowledge and Linear Time: PSPACE Cases

14 years 3 months ago

Download www.cse.unsw.edu.au

We present a general algorithm scheme for model checking logics of knowledge, common knowledge and linear time, based on simulations to a class of structures that capture the way t...

Kai Engelhardt, Peter Gammie, Ron van der Meyden

claim paper

Read More »

click to vote

VLDB
2007
ACM

116views Database» more VLDB 2007»

K-Anonymization as Spatial Indexing: Toward Scalable and Incremental Anonymization

14 years 2 months ago

Download pages.cs.wisc.edu

In this paper we observe that k-anonymizing a data set is strikingly similar to building a spatial index over the data set, so similar in fact that classical spatial indexing tech...

Tochukwu Iwuchukwu, Jeffrey F. Naughton

claim paper

Read More »

click to vote

ATAL
2005
Springer

98views Intelligent Agents» more ATAL 2005»

Using decision-theoretic models to enhance agent system survivability

14 years 2 months ago

Download www.cs.huji.ac.il

A survivable agent system depends on the incorporation of many recovery features. However, the optimal use of these features requires the ability to assess the actual state of the...

Anthony R. Cassandra, Marian H. Nodine, Shilpa Bon...

claim paper

Read More »

click to vote

HICSS
2003
IEEE

123views Biometrics» more HICSS 2003»

Issues in Rational Planning in Multi-Agent Settings

14 years 2 months ago

Download www.hicss.hawaii.edu

We adopt the decision-theoretic principle of expected utility maximization as a paradigm for designing autonomous rational agents operating in multi-agent environments. We use the...

Piotr J. Gmytrasiewicz

claim paper

Read More »

« Prev « First page 77 / 127 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers