Search Sciweavers | Sciweavers

802 search results - page 63 / 161

» Experts in a Markov Decision Process

113

click to vote

NIPS
2004

128views Information Technology» more NIPS 2004»

A Cost-Shaping LP for Bellman Error Minimization with Performance Guarantees

15 years 7 months ago

Download books.nips.cc

We introduce a new algorithm based on linear programming that approximates the differential value function of an average-cost Markov decision process via a linear combination of p...

Daniela Pucci de Farias, Benjamin Van Roy

claim paper

Read More »

134

click to vote

AAAI
1994

159views Intelligent Agents» more AAAI 1994»

Acting Optimally in Partially Observable Stochastic Domains

15 years 6 months ago

Download www.cs.rutgers.edu

In this paper, we describe the partially observable Markov decision process pomdp approach to nding optimal or near-optimal control strategies for partially observable stochastic ...

Anthony R. Cassandra, Leslie Pack Kaelbling, Micha...

claim paper

Read More »

199

click to vote

ICRA
2010
IEEE

101views Robotics» more ICRA 2010»

Multirobot coordination by auctioning POMDPs

15 years 4 months ago

Download users.isr.ist.utl.pt

— We consider the problem of task assignment and execution in multirobot systems, by proposing a procedure for bid estimation in auction protocols. Auctions are of interest to mu...

Matthijs T. J. Spaan, Nelson Gonçalves, Jo&...

claim paper

Read More »

147

click to vote

EUROPKI
2004
Springer

81views Security Privacy» more EUROPKI 2004»

A Probabilistic Model for Evaluating the Operational Cost of PKI-based Financial Transactions

15 years 11 months ago

Download security.ncsa.illinois.edu

The use of PKI in large scale environments suffers some inherent problems concerning the options to adopt for the optimal cost-centered operation of the system. In this paper a Mar...

Agapios N. Platis, Costas Lambrinoudakis, Assimaki...

claim paper

Read More »

159

click to vote

ATAL
2009
Springer

146views Intelligent Agents» more ATAL 2009»

Online exploration in least-squares policy iteration

16 years 3 days ago

Download www.aamas-conference.org

One of the key problems in reinforcement learning is balancing exploration and exploitation. Another is learning and acting in large or even continuous Markov decision processes (...

Lihong Li, Michael L. Littman, Christopher R. Mans...

claim paper

Read More »

« Prev « First page 63 / 161 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers