Search Sciweavers | Sciweavers

332 search results - page 26 / 67

» Ranking policies in discrete Markov decision processes

click to vote

ATAL
2005
Springer

117views Intelligent Agents» more ATAL 2005»

Modeling task allocation using a decision theoretic model

14 years 1 months ago

Download dis.cs.umass.edu

Mediation is the process of decomposing a task into subtasks, ﬁnding agents suitable for these subtasks and negotiating with agents to obtain commitments to execute these subtas...

Sherief Abdallah, Victor R. Lesser

claim paper

Read More »

click to vote

JMLR
2006

124views more JMLR 2006»

Policy Gradient in Continuous Time

13 years 8 months ago

Download hal.inria.fr

Policy search is a method for approximately solving an optimal control problem by performing a parametric optimization search in a given class of parameterized policies. In order ...

Rémi Munos

claim paper

Read More »

click to vote

CSFW
2010
IEEE

187views Security Privacy» more CSFW 2010»

Towards Quantitative Analysis of Proofs of Authorization: Applications, Framework, and Techniques

13 years 11 months ago

Download www.cs.pitt.edu

—Although policy compliance testing is generally treated as a binary decision problem, the evidence gathered during the trust management process can actually be used to examine t...

Adam J. Lee, Ting Yu

claim paper

Read More »

click to vote

WWW
2005
ACM

146views Internet Technology» more WWW 2005»

PageRank as a function of the damping factor

14 years 8 months ago

Download www2005.org

PageRank is defined as the stationary state of a Markov chain. The chain is obtained by perturbing the transition matrix induced by a web graph with a damping factor that spreads...

Paolo Boldi, Massimo Santini, Sebastiano Vigna

claim paper

Read More »

click to vote

GECCO
2004
Springer

142views Optimization» more GECCO 2004»

Improving MACS Thanks to a Comparison with 2TBNs

14 years 1 months ago

Download www.cs.york.ac.uk

Abstract. Factored Markov Decision Processes is the theoretical framework underlying multi-step Learning Classiﬁer Systems research. This framework is mostly used in the context ...

Olivier Sigaud, Thierry Gourdin, Pierre-Henri Wuil...

claim paper

Read More »

« Prev « First page 26 / 67 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers