Search Sciweavers | Sciweavers

262 search results - page 43 / 53

» Bounded-Parameter Partially Observable Markov Decision Proce...

177

click to vote

AAAI
2010

163views Intelligent Agents» more AAAI 2010»

Structured Parameter Elicitation

15 years 9 months ago

Download motion.comp.nus.edu.sg

The behavior of a complex system often depends on parameters whose values are unknown in advance. To operate effectively, an autonomous agent must actively gather information on t...

Li Ling Ko, David Hsu, Wee Sun Lee, Sylvie C. W. O...

claim paper

Read More »

202

click to vote

AAAI
2006

146views Intelligent Agents» more AAAI 2006»

Incremental Least Squares Policy Iteration for POMDPs

15 years 9 months ago

Download www.aaai.org

We present a new algorithm, called incremental least squares policy iteration (ILSPI), for finding the infinite-horizon stationary policy for partially observable Markov decision ...

Hui Li, Xuejun Liao, Lawrence Carin

claim paper

Read More »

182

click to vote

NIPS
2001

144views Information Technology» more NIPS 2001»

Variance Reduction Techniques for Gradient Estimates in Reinforcement Learning

15 years 8 months ago

Download jmlr.csail.mit.edu

Policy gradient methods for reinforcement learning avoid some of the undesirable properties of the value function approaches, such as policy degradation (Baxter and Bartlett, 2001...

Evan Greensmith, Peter L. Bartlett, Jonathan Baxte...

claim paper

Read More »

193

Voted

ATAL
2010
Springer

157views Intelligent Agents» more ATAL 2010»

Augmenting appearance-based localization and navigation using belief update

15 years 8 months ago

Download www.aamas-conference.org

Appearance-based localization compares the current image taken from a robot's camera to a set of pre-recorded images in order to estimate the current location of the robot. S...

George Chrysanthakopoulos, Guy Shani

claim paper

Read More »

203

click to vote

CSL
2010
Springer

152views Automated Reasoning» more CSL 2010»

The Hidden Information State model: A practical framework for POMDP-based spoken dialogue management

15 years 7 months ago

Download mi.eng.cam.ac.uk

This paper explains how Partially Observable Markov Decision Processes (POMDPs) can provide a principled mathematical framework for modelling the inherent uncertainty in spoken di...

Steve Young, Milica Gasic, Simon Keizer, Fran&cced...

claim paper

Read More »

« Prev « First page 43 / 53 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers