Search Sciweavers | Sciweavers

177

AAAI
2011

145views Intelligent Agents» more AAAI 2011»

Policy Gradient Planning for Environmental Decision Making with Existing Simulators

14 years 5 months ago

In environmental and natural resource planning domains actions are taken at a large number of locations over multiple time periods. These problems have enormous state and action s...

Mark Crowley, David Poole

claim paper

Read More »

139

click to vote

CORR
2008
Springer

89views Education» more CORR 2008»

Flow Faster: Efficient Decision Algorithms for Probabilistic Simulations

15 years 5 months ago

Download www-i2.informatik.rwth-aachen.de

Strong and weak simulation relations have been proposed for Markov chains, while strong simulation and strong probabilistic simulation relations have been proposed for probabilisti...

Lijun Zhang, Holger Hermanns, Friedrich Eisenbrand...

claim paper

Read More »

124

click to vote

EPEW
2007
Springer

118views Internet Technology» more EPEW 2007»

Compositionality for Markov Reward Chains with Fast Transitions

15 years 9 months ago

Download alexandria.tue.nl

A parallel composition is defined for Markov reward chains with fast transitions and for discontinuous Markov reward chains. In this setting, compositionality with respect to the r...

Jasen Markovski, Ana Sokolova, Nikola Trcka, Erik ...

claim paper

Read More »

163

click to vote

TSMC
2008

146views more TSMC 2008»

Decentralized Learning in Markov Games

15 years 5 months ago

Download como.vub.ac.be

Learning Automata (LA) were recently shown to be valuable tools for designing Multi-Agent Reinforcement Learning algorithms. One of the principal contributions of LA theory is tha...

Peter Vrancx, Katja Verbeeck, Ann Nowé

claim paper

Read More »

169

click to vote

QEST
2010
IEEE

154views Modeling and Simulation» more QEST 2010»

Symblicit Calculation of Long-Run Averages for Concurrent Probabilistic Systems

15 years 3 months ago

Download www.informatik.uni-freiburg.de

Abstract--Model checkers for concurrent probabilistic systems have become very popular within the last decade. The study of long-run average behavior has however received only scan...

Ralf Wimmer, Bettina Braitling, Bernd Becker, Erns...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers