Search Sciweavers | Sciweavers

49 search results - page 6 / 10

» An Iterative Algorithm for Solving Constrained Decentralized...

click to vote

CORR
2010
Springer

105views Education» more CORR 2010»

Optimism in Reinforcement Learning Based on Kullback-Leibler Divergence

13 years 6 months ago

Download hal.archives-ouvertes.fr

We consider model-based reinforcement learning in ﬁnite Markov Decision Processes (MDPs), focussing on so-called optimistic strategies. Optimism is usually implemented by carryin...

Sarah Filippi, Olivier Cappé, Aurelien Gari...

claim paper

Read More »

click to vote

SIGMETRICS
2000
ACM

105views Hardware» more SIGMETRICS 2000»

Using the exact state space of a Markov model to compute approximate stationary measures

13 years 11 months ago

Download www.cs.ucr.edu

We present a new approximation algorithm based on an exact representation of the state space S, using decision diagrams, and of the transition rate matrix R, using Kronecker algeb...

Andrew S. Miner, Gianfranco Ciardo, Susanna Donate...

claim paper

Read More »

click to vote

AAAI
2006

134views Intelligent Agents» more AAAI 2006»

Point-based Dynamic Programming for DEC-POMDPs

13 years 8 months ago

Download hal.archives-ouvertes.fr

We introduce point-based dynamic programming (DP) for decentralized partially observable Markov decision processes (DEC-POMDPs), a new discrete DP algorithm for planning strategie...

Daniel Szer, François Charpillet

claim paper

Read More »

click to vote

AAAI
2004

167views Intelligent Agents» more AAAI 2004»

Dynamic Programming for Partially Observable Stochastic Games

13 years 8 months ago

Download anytime.cs.umass.edu

We develop an exact dynamic programming algorithm for partially observable stochastic games (POSGs). The algorithm is a synthesis of dynamic programming for partially observable M...

Eric A. Hansen, Daniel S. Bernstein, Shlomo Zilber...

claim paper

Read More »

click to vote

MANSCI
2007

139views more MANSCI 2007»

A Market-Based Optimization Algorithm for Distributed Systems

13 years 7 months ago

Download userpages.umbc.edu

In this paper, a market-based decomposition method for decomposable linear systems is developed. The solution process iterates between a master problem that solves the market-matc...

Zhiling Guo, Gary J. Koehler, Andrew B. Whinston

claim paper

Read More »

« Prev « First page 6 / 10 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers