Search Sciweavers | Sciweavers

802 search results - page 45 / 161

» Experts in a Markov Decision Process

click to vote

QEST
2006
IEEE

162views Modeling and Simulation» more QEST 2006»

LiQuor: A tool for Qualitative and Quantitative Linear Time analysis of Reactive Systems

14 years 1 months ago

Download www.win.tue.nl

LiQuor is a tool for verifying probabilistic reactive systems modelled Probmela programs, which are terms of a probabilistic guarded command language with an operational semantics...

Frank Ciesinski, Christel Baier

claim paper

Read More »

click to vote

ICML
2005
IEEE

133views Machine Learning» more ICML 2005»

A theoretical analysis of Model-Based Interval Estimation

14 years 8 months ago

Download paul.rutgers.edu

Several algorithms for learning near-optimal policies in Markov Decision Processes have been analyzed and proven efficient. Empirical results have suggested that Model-based Inter...

Alexander L. Strehl, Michael L. Littman

claim paper

Read More »

click to vote

VMCAI
2010
Springer

204views Software Engineering» more VMCAI 2010»

Best Probabilistic Transformers

14 years 5 months ago

Download rw4.cs.uni-sb.de

This paper investigates relative precision and optimality of analyses for concurrent probabilistic systems. Aiming at the problem at the heart of probabilistic model checking ? com...

Björn Wachter, Lijun Zhang

claim paper

Read More »

click to vote

ICRA
2007
IEEE

134views Robotics» more ICRA 2007»

Grasping POMDPs

14 years 2 months ago

Download people.csail.mit.edu

Abstract— We provide a method for planning under uncertainty for robotic manipulation by partitioning the conﬁguration space into a set of regions that are closed under complia...

Kaijen Hsiao, Leslie Pack Kaelbling, Tomás ...

claim paper

Read More »

click to vote

COLT
2000
Springer

87views Machine Learning» more COLT 2000»

Estimation and Approximation Bounds for Gradient-Based Reinforcement Learning

14 years 2 days ago

Download www.cs.iastate.edu

We model reinforcement learning as the problem of learning to control a Partially Observable Markov Decision Process ( ¢¡¤£¦¥§ ), and focus on gradient ascent approache...

Peter L. Bartlett, Jonathan Baxter

claim paper

Read More »

« Prev « First page 45 / 161 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers