Search Sciweavers | Sciweavers

267 search results - page 9 / 54

» Qualitative Analysis of Partially-Observable Markov Decision...

125

click to vote

COLT
2000
Springer

87views Machine Learning» more COLT 2000»

Estimation and Approximation Bounds for Gradient-Based Reinforcement Learning

15 years 10 months ago

Download www.cs.iastate.edu

We model reinforcement learning as the problem of learning to control a Partially Observable Markov Decision Process ( ¢¡¤£¦¥§ ), and focus on gradient ascent approache...

Peter L. Bartlett, Jonathan Baxter

claim paper

Read More »

138

click to vote

ICRA
2007
IEEE

126views Robotics» more ICRA 2007»

A formal framework for robot learning and control under model uncertainty

15 years 12 months ago

Download www.cs.mcgill.ca

— While the Partially Observable Markov Decision Process (POMDP) provides a formal framework for the problem of robot control under uncertainty, it typically assumes a known and ...

Robin Jaulmes, Joelle Pineau, Doina Precup

claim paper

Read More »

155

click to vote

AAAI
2007

106views Intelligent Agents» more AAAI 2007»

Situated Conversational Agents

15 years 8 months ago

Download www.aaai.org

A Situated Conversational Agent (SCA) is an agent that engages in dialog about the context within which it is embedded. Situated dialog is characterized by its deep connection to ...

William Thompson

claim paper

Read More »

170

click to vote

UAI
2003

104views Artificial Intelligence» more UAI 2003»

Optimal Limited Contingency Planning

15 years 7 months ago

Download ti.arc.nasa.gov

For a given problem, the optimal Markov policy over a ﬁnite horizon is a conditional plan containing a potentially large number of branches. However, there are applications wher...

Nicolas Meuleau, David E. Smith

claim paper

Read More »

131

click to vote

CDC
2008
IEEE

140views Control Systems» more CDC 2008»

Information state for Markov decision processes with network delays

16 years 4 days ago

Download wsl.stanford.edu

We consider a networked control system, where each subsystem evolves as a Markov decision process (MDP). Each subsystem is coupled to its neighbors via communication links over wh...

Sachin Adlakha, Sanjay Lall, Andrea J. Goldsmith

claim paper

Read More »

« Prev « First page 9 / 54 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers