Sciweavers

262 search results - page 11 / 53
» Bounded-Parameter Partially Observable Markov Decision Proce...
Sort
View
ACL
2010
15 years 1 months ago
Towards Relational POMDPs for Adaptive Dialogue Management
Open-ended spoken interactions are typically characterised by both structural complexity and high levels of uncertainty, making dialogue management in such settings a particularly...
Pierre Lison
141
Voted
UAI
2003
15 years 5 months ago
Optimal Limited Contingency Planning
For a given problem, the optimal Markov policy over a finite horizon is a conditional plan containing a potentially large number of branches. However, there are applications wher...
Nicolas Meuleau, David E. Smith
ICRA
2007
IEEE
134views Robotics» more  ICRA 2007»
15 years 10 months ago
Grasping POMDPs
Abstract— We provide a method for planning under uncertainty for robotic manipulation by partitioning the configuration space into a set of regions that are closed under complia...
Kaijen Hsiao, Leslie Pack Kaelbling, Tomás ...
113
Voted
COLT
2000
Springer
15 years 8 months ago
Estimation and Approximation Bounds for Gradient-Based Reinforcement Learning
We model reinforcement learning as the problem of learning to control a Partially Observable Markov Decision Process (  ¢¡¤£¦¥§  ), and focus on gradient ascent approache...
Peter L. Bartlett, Jonathan Baxter
141
Voted
ICTAI
2000
IEEE
15 years 7 months ago
Building efficient partial plans using Markov decision processes
Markov Decision Processes (MDP) have been widely used as a framework for planning under uncertainty. They allow to compute optimal sequences of actions in order to achieve a given...
Pierre Laroche