Search Sciweavers | Sciweavers

771 search results - page 56 / 155

» Markov Decision Processes with Arbitrary Reward Processes

130

click to vote

CORR
2010
Springer

112views Education» more CORR 2010»

Efficient Approximation of Optimal Control for Markov Games

15 years 4 months ago

Download react.cs.uni-sb.de

The success of probabilistic model checking for discrete-time Markov decision processes and continuous-time Markov chains has led to rich academic and industrial applications. The ...

Markus Rabe, Sven Schewe, Lijun Zhang

claim paper

Read More »

128

click to vote

ICTAI
1996
IEEE

112views Artificial Intelligence» more ICTAI 1996»

Incremental Markov-Model Planning

15 years 8 months ago

Download reference.kfupm.edu.sa

This paper presents an approach to building plans using partially observable Markov decision processes. The approach begins with a base solution that assumes full observability. T...

Richard Washington

claim paper

Read More »

142

Voted

ICML
1994
IEEE

152views Machine Learning» more ICML 1994»

Markov Games as a Framework for Multi-Agent Reinforcement Learning

15 years 7 months ago

Download www.cs.rutgers.edu

In the Markov decision process (MDP) formalization of reinforcement learning, a single adaptive agent interacts with an environment defined by a probabilistic transition function....

Michael L. Littman

claim paper

Read More »

133

click to vote

JAIR
2006

122views more JAIR 2006»

Solving Factored MDPs with Hybrid State and Action Variables

15 years 4 months ago

Download www.jair.org

Efficient representations and solutions for large decision problems with continuous and discrete variables are among the most important challenges faced by the designers of automa...

Branislav Kveton, Milos Hauskrecht, Carlos Guestri...

claim paper

Read More »

142

click to vote

FSTTCS
2006
Springer

149views Software Engineering» more FSTTCS 2006»

Testing Probabilistic Equivalence Through Reinforcement Learning

15 years 7 months ago

Download www2.ift.ulaval.ca

We propose a new approach to verification of probabilistic processes for which the model may not be available. We use a technique from Reinforcement Learning to approximate how far...

Josee Desharnais, François Laviolette, Sami...

claim paper

Read More »

« Prev « First page 56 / 155 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers