Search Sciweavers | Sciweavers

47 search results - page 5 / 10

» Average-Reward Decentralized Markov Decision Processes

156

click to vote

ICC
2007
IEEE

137views Communications» more ICC 2007»

Optimality and Complexity of Opportunistic Spectrum Access: A Truncated Markov Decision Process Formulation

16 years 8 days ago

Download www.ece.ucdavis.edu

— We consider opportunistic spectrum access (OSA) which allows secondary users to identify and exploit instantaneous spectrum opportunities resulting from the bursty trafﬁc of ...

Dejan V. Djonin, Qing Zhao, Vikram Krishnamurthy

claim paper

Read More »

161

click to vote

CAV
2010
Springer

190views Hardware» more CAV 2010»

Measuring and Synthesizing Systems in Probabilistic Environments

15 years 9 months ago

Download www-verimag.imag.fr

Often one has a preference order among the different systems that satisfy a given specification. Under a probabilistic assumption about the possible inputs, such a preference order...

Krishnendu Chatterjee, Thomas A. Henzinger, Barbar...

claim paper

Read More »

162

click to vote

NECO
2007

150views more NECO 2007»

Reinforcement Learning, Spike-Time-Dependent Plasticity, and the BCM Rule

15 years 5 months ago

Download eprints.pascal-network.org

Learning agents, whether natural or artiﬁcial, must update their internal parameters in order to improve their behavior over time. In reinforcement learning, this plasticity is ...

Dorit Baras, Ron Meir

claim paper

Read More »

152

click to vote

NIPS
2007

146views Information Technology» more NIPS 2007»

Optimistic Linear Programming gives Logarithmic Regret for Irreducible MDPs

15 years 7 months ago

Download books.nips.cc

We present an algorithm called Optimistic Linear Programming (OLP) for learning to optimize average reward in an irreducible but otherwise unknown Markov decision process (MDP). O...

Ambuj Tewari, Peter L. Bartlett

claim paper

Read More »

151

click to vote

JAIR
2008

145views more JAIR 2008»

Communication-Based Decomposition Mechanisms for Decentralized MDPs

15 years 6 months ago

Download anytime.cs.umass.edu

Multi-agent planning in stochastic environments can be framed formally as a decentralized Markov decision problem. Many real-life distributed problems that arise in manufacturing,...

Claudia V. Goldman, Shlomo Zilberstein

claim paper

Read More »

« Prev « First page 5 / 10 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers