Search Sciweavers | Sciweavers

499 search results - page 35 / 100

» Model Minimization in Markov Decision Processes

128

click to vote

ATAL
2007
Springer

94views Intelligent Agents» more ATAL 2007»

Graphical models for online solutions to interactive POMDPs

15 years 10 months ago

Download www.cs.uga.edu

We develop a new graphical representation for interactive partially observable Markov decision processes (I-POMDPs) that is significantly more transparent and semantically clear t...

Prashant Doshi, Yifeng Zeng, Qiongyu Chen

claim paper

Read More »

119

click to vote

ATAL
2007
Springer

112views Intelligent Agents» more ATAL 2007»

A globally optimal algorithm for TTD-MDPs

15 years 10 months ago

Download www.cc.gatech.edu

In this paper, we discuss the use of Targeted Trajectory Distribution Markov Decision Processes (TTD-MDPs)—a variant of MDPs in which the goal is to realize a speciﬁed distrib...

Sooraj Bhat, David L. Roberts, Mark J. Nelson, Cha...

claim paper

Read More »

148

click to vote

ATAL
2010
Springer

157views Intelligent Agents» more ATAL 2010»

Augmenting appearance-based localization and navigation using belief update

15 years 5 months ago

Download www.aamas-conference.org

Appearance-based localization compares the current image taken from a robot's camera to a set of pre-recorded images in order to estimate the current location of the robot. S...

George Chrysanthakopoulos, Guy Shani

claim paper

Read More »

128

click to vote

COMPLEX
2009
Springer

109views Theoretical Computer Science» more COMPLEX 2009»

Non-sufficient Memories That Are Sufficient for Prediction

15 years 8 months ago

Download personal-homepages.mis.mpg.de

The causal states of computational mechanics define the minimal sufficient (prescient) memory for a given stationary stochastic process. They induce the -machine which is a hidden...

Wolfgang Löhr, Nihat Ay

claim paper

Read More »

131

click to vote

ICML
2006
IEEE

131views Machine Learning» more ICML 2006»

PAC model-free reinforcement learning

16 years 5 months ago

Download cseweb.ucsd.edu

For a Markov Decision Process with finite state (size S) and action spaces (size A per state), we propose a new algorithm--Delayed Q-Learning. We prove it is PAC, achieving near o...

Alexander L. Strehl, Lihong Li, Eric Wiewiora, Joh...

claim paper

Read More »

« Prev « First page 35 / 100 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers