Search Sciweavers | Sciweavers

262 search results - page 26 / 53

» Bounded-Parameter Partially Observable Markov Decision Proce...

215

click to vote

ICCD
2006
IEEE

171views Hardware» more ICCD 2006»

Stochastic Dynamic Thermal Management: A Markovian Decision-based Approach

16 years 4 months ago

Download atrak.usc.edu

This paper proposes a stochastic dynamic thermal management (DTM) technique in high-performance VLSI system with especial attention to the uncertainty in temperature observation. ...

Hwisung Jung, Massoud Pedram

claim paper

Read More »

195

click to vote

IJCAI
2007

160views Artificial Intelligence» more IJCAI 2007»

Learning from Partial Observations

15 years 9 months ago

Download www.ijcai.org

We present a general machine learning framework for modelling the phenomenon of missing information in data. We propose a masking process model to capture the stochastic nature of...

Loizos Michael

claim paper

Read More »

206

click to vote

IJCAI
2003

142views Artificial Intelligence» more IJCAI 2003»

Taming Decentralized POMDPs: Towards Efficient Policy Computation for Multiagent Settings

15 years 8 months ago

Download dli.iiit.ac.in

The problem of deriving joint policies for a group of agents that maximize some joint reward function can be modeled as a decentralized partially observable Markov decision proces...

Ranjit Nair, Milind Tambe, Makoto Yokoo, David V. ...

claim paper

Read More »

219

click to vote

JAIR
2006

160views more JAIR 2006»

Anytime Point-Based Approximations for Large POMDPs

15 years 7 months ago

Download www.jair.org

The Partially Observable Markov Decision Process has long been recognized as a rich framework for real-world planning and control problems, especially in robotics. However exact s...

Joelle Pineau, Geoffrey J. Gordon, Sebastian Thrun

claim paper

Read More »

284

click to vote

CSL
2012
Springer

311views Automated Reasoning» more CSL 2012»

Reinforcement learning for parameter estimation in statistical spoken dialogue systems

14 years 3 months ago

Download mi.eng.cam.ac.uk

Reinforcement techniques have been successfully used to maximise the expected cumulative reward of statistical dialogue systems. Typically, reinforcement learning is used to estim...

Filip Jurcícek, Blaise Thomson, Steve Young

claim paper

Read More »

« Prev « First page 26 / 53 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers