Search Sciweavers | Sciweavers

267 search results - page 40 / 54

» Qualitative Analysis of Partially-Observable Markov Decision...

161

click to vote

AAAI
2006

146views Intelligent Agents» more AAAI 2006»

Incremental Least Squares Policy Iteration for POMDPs

15 years 7 months ago

Download www.aaai.org

We present a new algorithm, called incremental least squares policy iteration (ILSPI), for finding the infinite-horizon stationary policy for partially observable Markov decision ...

Hui Li, Xuejun Liao, Lawrence Carin

claim paper

Read More »

148

click to vote

NIPS
2001

144views Information Technology» more NIPS 2001»

Variance Reduction Techniques for Gradient Estimates in Reinforcement Learning

15 years 7 months ago

Download jmlr.csail.mit.edu

Policy gradient methods for reinforcement learning avoid some of the undesirable properties of the value function approaches, such as policy degradation (Baxter and Bartlett, 2001...

Evan Greensmith, Peter L. Bartlett, Jonathan Baxte...

claim paper

Read More »

173

click to vote

NIPS
2003

145views Information Technology» more NIPS 2003»

A Nonlinear Predictive State Representation

15 years 7 months ago

Download books.nips.cc

Predictive state representations (PSRs) use predictions of a set of tests to represent the state of controlled dynamical systems. One reason why this representation is exciting as...

Matthew R. Rudary, Satinder P. Singh

claim paper

Read More »

159

click to vote

CSL
2010
Springer

152views Automated Reasoning» more CSL 2010»

The Hidden Information State model: A practical framework for POMDP-based spoken dialogue management

15 years 5 months ago

Download mi.eng.cam.ac.uk

This paper explains how Partially Observable Markov Decision Processes (POMDPs) can provide a principled mathematical framework for modelling the inherent uncertainty in spoken di...

Steve Young, Milica Gasic, Simon Keizer, Fran&cced...

claim paper

Read More »

185

click to vote

JAIR
2010

115views more JAIR 2010»

An Investigation into Mathematical Programming for Finite Horizon Decentralized POMDPs

15 years 4 months ago

Download www.jair.org

Decentralized planning in uncertain environments is a complex task generally dealt with by using a decision-theoretic approach, mainly through the framework of Decentralized Parti...

Raghav Aras, Alain Dutech

claim paper

Read More »

« Prev « First page 40 / 54 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers