Search Sciweavers | Sciweavers

27 search results - page 3 / 6

» Compositionality for Markov Reward Chains with Fast Transiti...

168

click to vote

TSMC
2008

146views more TSMC 2008»

Decentralized Learning in Markov Games

15 years 6 months ago

Download como.vub.ac.be

Learning Automata (LA) were recently shown to be valuable tools for designing Multi-Agent Reinforcement Learning algorithms. One of the principal contributions of LA theory is tha...

Peter Vrancx, Katja Verbeeck, Ann Nowé

claim paper

Read More »

184

click to vote

AAAI
2011

145views Intelligent Agents» more AAAI 2011»

Policy Gradient Planning for Environmental Decision Making with Existing Simulators

14 years 6 months ago

Download www.cs.ubc.ca

In environmental and natural resource planning domains actions are taken at a large number of locations over multiple time periods. These problems have enormous state and action s...

Mark Crowley, David Poole

claim paper

Read More »

173

click to vote

QEST
2005
IEEE

137views Modeling and Simulation» more QEST 2005»

iLTLChecker: A Probabilistic Model Checker for Multiple DTMCs

15 years 11 months ago

Download osl.cs.uiuc.edu

iLTL is a probabilistic temporal logic that can specify properties of multiple discrete time Markov chains (DTMCs). In this paper, we describe two related tools: MarkovEstimator a...

YoungMin Kwon, Gul A. Agha

claim paper

Read More »

163

click to vote

ATAL
2008
Springer

136views Intelligent Agents» more ATAL 2008»

Interaction-driven Markov games for decentralized multiagent planning under uncertainty

15 years 8 months ago

Download users.isr.ist.utl.pt

In this paper we propose interaction-driven Markov games (IDMGs), a new model for multiagent decision making under uncertainty. IDMGs aim at describing multiagent decision problem...

Matthijs T. J. Spaan, Francisco S. Melo

claim paper

Read More »

188

click to vote

CORR
2007
Springer

143views Education» more CORR 2007»

On Myopic Sensing for Multi-Channel Opportunistic Access

15 years 6 months ago

Download www.ece.ucdavis.edu

We consider a multi-channel opportunistic communication system where the states of these channels evolve as independent and statistically identical Markov chains (the Gilbert-Elli...

Qing Zhao, Bhaskar Krishnamachari, Keqin Liu

claim paper

Read More »

« Prev « First page 3 / 6 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers