Search Sciweavers | Sciweavers

135 search results - page 7 / 27

» Bounded Parameter Markov Decision Processes

176

Voted

IJCAI
2007

147views Artificial Intelligence» more IJCAI 2007»

The Value of Observation for Monitoring Dynamic Systems

15 years 9 months ago

Download ijcai.org

We consider the fundamental problem of monitoring (i.e. tracking) the belief state in a dynamic system, when the model is only approximately correct and when the initial belief st...

Eyal Even-Dar, Sham M. Kakade, Yishay Mansour

claim paper

Read More »

214

click to vote

CORR
2008
Springer

189views Education» more CORR 2008»

Algorithms for Dynamic Spectrum Access with Learning for Cognitive Radio

15 years 7 months ago

Download www.ifp.illinois.edu

We study the problem of dynamic spectrum sensing and access in cognitive radio systems as a partially observed Markov decision process (POMDP). A group of cognitive users cooperati...

Jayakrishnan Unnikrishnan, Venugopal V. Veeravalli

claim paper

Read More »

221

click to vote

IROS
2009
IEEE

206views Robotics» more IROS 2009»

Bayesian reinforcement learning in continuous POMDPs with gaussian processes

16 years 2 months ago

Download www.cs.cmu.edu

— Partially Observable Markov Decision Processes (POMDPs) provide a rich mathematical model to handle realworld sequential decision processes but require a known model to be solv...

Patrick Dallaire, Camille Besse, Stéphane R...

claim paper

Read More »

210

Voted

CDC
2009
IEEE

134views Control Systems» more CDC 2009»

Event-based control using quadratic approximate value functions

16 years 6 days ago

Download people.virginia.edu

Abstract— In this paper we consider several problems involving control with limited actuation and sampling rates. Event-based control has emerged as an attractive approach for ad...

Randy Cogill

claim paper

Read More »

284

click to vote

CSL
2012
Springer

311views Automated Reasoning» more CSL 2012»

Reinforcement learning for parameter estimation in statistical spoken dialogue systems

14 years 3 months ago

Download mi.eng.cam.ac.uk

Reinforcement techniques have been successfully used to maximise the expected cumulative reward of statistical dialogue systems. Typically, reinforcement learning is used to estim...

Filip Jurcícek, Blaise Thomson, Steve Young

claim paper

Read More »

« Prev « First page 7 / 27 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers