Search Sciweavers | Sciweavers

771 search results - page 74 / 155

» Markov Decision Processes with Arbitrary Reward Processes

132

click to vote

JMLR
2006

143views more JMLR 2006»

Geometric Variance Reduction in Markov Chains: Application to Value Function and Gradient Estimation

15 years 4 months ago

Download www.aaai.org

We study a sequential variance reduction technique for Monte Carlo estimation of functionals in Markov Chains. The method is based on designing sequential control variates using s...

Rémi Munos

claim paper

Read More »

133

click to vote

EUROPKI
2004
Springer

81views Security Privacy» more EUROPKI 2004»

A Probabilistic Model for Evaluating the Operational Cost of PKI-based Financial Transactions

15 years 9 months ago

Download security.ncsa.illinois.edu

The use of PKI in large scale environments suffers some inherent problems concerning the options to adopt for the optimal cost-centered operation of the system. In this paper a Mar...

Agapios N. Platis, Costas Lambrinoudakis, Assimaki...

claim paper

Read More »

144

click to vote

ATAL
2009
Springer

146views Intelligent Agents» more ATAL 2009»

Online exploration in least-squares policy iteration

15 years 10 months ago

Download www.aamas-conference.org

One of the key problems in reinforcement learning is balancing exploration and exploitation. Another is learning and acting in large or even continuous Markov decision processes (...

Lihong Li, Michael L. Littman, Christopher R. Mans...

claim paper

Read More »

134

click to vote

STACS
1997
Springer

137views Theoretical Computer Science» more STACS 1997»

Methods and Applications of (MAX, +) Linear Algebra

15 years 8 months ago

Download www-rocq.inria.fr

Exotic semirings such as the “(max, +) semiring” (R ∪ {−∞}, max, +), or the “tropical semiring” (N ∪ {+∞}, min, +), have been invented and reinvented many times s...

Stephane Gaubert, Max Plus

claim paper

Read More »

127

click to vote

AI
2006
Springer

110views Artificial Intelligence» more AI 2006»

An Efficient Resource Allocation Approach in Real-Time Stochastic Environment

15 years 8 months ago

Download www.damas.ift.ulaval.ca

We are interested in contributing to solving effectively a particular type of real-time stochastic resource allocation problem. Firstly, one distinction is that certain tasks may c...

Pierrick Plamondon, Brahim Chaib-draa, Abder Rezak...

claim paper

Read More »

« Prev « First page 74 / 155 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers