Search Sciweavers | Sciweavers

260 search results - page 40 / 52

» Quasi-Deterministic Partially Observable Markov Decision Pro...

click to vote

NIPS
2007

170views Information Technology» more NIPS 2007»

What makes some POMDP problems easy to approximate?

13 years 10 months ago

Download books.nips.cc

Point-based algorithms have been surprisingly successful in computing approximately optimal solutions for partially observable Markov decision processes (POMDPs) in high dimension...

David Hsu, Wee Sun Lee, Nan Rong

claim paper

Read More »

click to vote

AIPS
2008

155views Artificial Intelligence» more AIPS 2008»

HiPPo: Hierarchical POMDPs for Planning Information Processing and Sensing Actions on a Robot

13 years 11 months ago

Download www.cs.bham.ac.uk

Flexible general purpose robots need to tailor their visual processing to their task, on the fly. We propose a new approach to this within a planning framework, where the goal is ...

Mohan Sridharan, Jeremy L. Wyatt, Richard Dearden

claim paper

Read More »

click to vote

SBMF
2009
Springer

126views Formal Methods» more SBMF 2009»

Undecidability Results for Distributed Probabilistic Systems

14 years 1 months ago

Download www.cs.famaf.unc.edu.ar

Abstract. In the veriﬁcation of concurrent systems involving probabilities, the aim is to ﬁnd out the maximum/minimum probability that a given event occurs (examples of such ev...

Sergio Giro

claim paper

Read More »

click to vote

IJFCS
2008

130views more IJFCS 2008»

Equivalence of Labeled Markov Chains

13 years 8 months ago

Download mtc.epfl.ch

We consider the equivalence problem for labeled Markov chains (LMCs), where each state is labeled with an observation. Two LMCs are equivalent if every finite sequence of observat...

Laurent Doyen, Thomas A. Henzinger, Jean-Fran&cced...

claim paper

Read More »

click to vote

ML
2002
ACM

121views Machine Learning» more ML 2002»

Near-Optimal Reinforcement Learning in Polynomial Time

13 years 8 months ago

Download www.cis.upenn.edu

We present new algorithms for reinforcement learning, and prove that they have polynomial bounds on the resources required to achieve near-optimal return in general Markov decisio...

Michael J. Kearns, Satinder P. Singh

claim paper

Read More »

« Prev « First page 40 / 52 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers