Search Sciweavers | Sciweavers

771 search results - page 95 / 155

» Markov Decision Processes with Arbitrary Reward Processes

140

click to vote

AIPS
1998

127views Artificial Intelligence» more AIPS 1998»

Solving Stochastic Planning Problems with Large State and Action Spaces

15 years 5 months ago

Download www.cs.brown.edu

Planning methods for deterministic planning problems traditionally exploit factored representations to encode the dynamics of problems in terms of a set of parameters, e.g., the l...

Thomas Dean, Robert Givan, Kee-Eung Kim

claim paper

Read More »

137

click to vote

SBMF
2009
Springer

126views Formal Methods» more SBMF 2009»

Undecidability Results for Distributed Probabilistic Systems

15 years 8 months ago

Download www.cs.famaf.unc.edu.ar

Abstract. In the veriﬁcation of concurrent systems involving probabilities, the aim is to ﬁnd out the maximum/minimum probability that a given event occurs (examples of such ev...

Sergio Giro

claim paper

Read More »

123

click to vote

TSP
2010

119views Artificial Intelligence» more TSP 2010»

Universal randomized switching

14 years 10 months ago

Download www.ifp.illinois.edu

Abstract--In this paper, we consider a competitive approach to sequential decision problems, suitable for a variety of signal processing applications where at each of a succession ...

Suleyman Serdar Kozat, Andrew C. Singer

claim paper

Read More »

163

click to vote

UAI
2000

91views Artificial Intelligence» more UAI 2000»

Value-Directed Belief State Approximation for POMDPs

15 years 5 months ago

Download www.cs.uwaterloo.ca

We consider the problem belief-state monitoring for the purposes of implementing a policy for a partially-observable Markov decision process (POMDP), specifically how one might ap...

Pascal Poupart, Craig Boutilier

claim paper

Read More »

124

click to vote

JMLR
2010

125views more JMLR 2010»

Variational methods for Reinforcement Learning

14 years 10 months ago

Download jmlr.csail.mit.edu

We consider reinforcement learning as solving a Markov decision process with unknown transition distribution. Based on interaction with the environment, an estimate of the transit...

Thomas Furmston, David Barber

claim paper

Read More »

« Prev « First page 95 / 155 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers