Search Sciweavers | Sciweavers

231 search results - page 18 / 47

» Active Learning in Partially Observable Markov Decision Proc...

283

Voted

CSL
2012
Springer

311views Automated Reasoning» more CSL 2012»

Reinforcement learning for parameter estimation in statistical spoken dialogue systems

14 years 3 months ago

Download mi.eng.cam.ac.uk

Reinforcement techniques have been successfully used to maximise the expected cumulative reward of statistical dialogue systems. Typically, reinforcement learning is used to estim...

Filip Jurcícek, Blaise Thomson, Steve Young

claim paper

Read More »

216

Voted

ICRA
2008
IEEE

173views Robotics» more ICRA 2008»

Bayesian reinforcement learning in continuous POMDPs with application to robot navigation

16 years 1 months ago

Download www.cs.cmu.edu

— We consider the problem of optimal control in continuous and partially observable environments when the parameters of the model are not known exactly. Partially Observable Mark...

Stéphane Ross, Brahim Chaib-draa, Joelle Pi...

claim paper

Read More »

176

Voted

IJCAI
2007

147views Artificial Intelligence» more IJCAI 2007»

The Value of Observation for Monitoring Dynamic Systems

15 years 8 months ago

Download ijcai.org

We consider the fundamental problem of monitoring (i.e. tracking) the belief state in a dynamic system, when the model is only approximately correct and when the initial belief st...

Eyal Even-Dar, Sham M. Kakade, Yishay Mansour

claim paper

Read More »

193

click to vote

NIPS
2004

125views Information Technology» more NIPS 2004»

VDCBPI: an Approximate Scalable Algorithm for Large POMDPs

15 years 8 months ago

Download books.nips.cc

Existing algorithms for discrete partially observable Markov decision processes can at best solve problems of a few thousand states due to two important sources of intractability:...

Pascal Poupart, Craig Boutilier

claim paper

Read More »

182

click to vote

AAAI
2007

117views Intelligent Agents» more AAAI 2007»

Optimizing Anthrax Outbreak Detection Using Reinforcement Learning

15 years 9 months ago

Download www.aaai.org

The potentially catastrophic impact of a bioterrorist attack makes developing effective detection methods essential for public health. In the case of anthrax attack, a delay of ho...

Masoumeh T. Izadi, David L. Buckeridge

claim paper

Read More »

« Prev « First page 18 / 47 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers