Search Sciweavers | Sciweavers

262 search results - page 11 / 53

» Bounded-Parameter Partially Observable Markov Decision Proce...

248

click to vote

ACL
2010

175views Computational Linguistics» more ACL 2010»

Towards Relational POMDPs for Adaptive Dialogue Management

15 years 5 months ago

Download aclweb.org

Open-ended spoken interactions are typically characterised by both structural complexity and high levels of uncertainty, making dialogue management in such settings a particularly...

Pierre Lison

claim paper

Read More »

224

click to vote

UAI
2003

104views Artificial Intelligence» more UAI 2003»

Optimal Limited Contingency Planning

15 years 8 months ago

Download ti.arc.nasa.gov

For a given problem, the optimal Markov policy over a ﬁnite horizon is a conditional plan containing a potentially large number of branches. However, there are applications wher...

Nicolas Meuleau, David E. Smith

claim paper

Read More »

216

click to vote

ICRA
2007
IEEE

134views Robotics» more ICRA 2007»

Grasping POMDPs

16 years 1 months ago

Download people.csail.mit.edu

Abstract— We provide a method for planning under uncertainty for robotic manipulation by partitioning the conﬁguration space into a set of regions that are closed under complia...

Kaijen Hsiao, Leslie Pack Kaelbling, Tomás ...

claim paper

Read More »

162

click to vote

COLT
2000
Springer

87views Machine Learning» more COLT 2000»

Estimation and Approximation Bounds for Gradient-Based Reinforcement Learning

15 years 11 months ago

Download www.cs.iastate.edu

We model reinforcement learning as the problem of learning to control a Partially Observable Markov Decision Process ( ¢¡¤£¦¥§ ), and focus on gradient ascent approache...

Peter L. Bartlett, Jonathan Baxter

claim paper

Read More »

204

click to vote

ICTAI
2000
IEEE

186views Artificial Intelligence» more ICTAI 2000»

Building efficient partial plans using Markov decision processes

15 years 11 months ago

Download ccc.inaoep.mx

Markov Decision Processes (MDP) have been widely used as a framework for planning under uncertainty. They allow to compute optimal sequences of actions in order to achieve a given...

Pierre Laroche

claim paper

Read More »

« Prev « First page 11 / 53 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers