Search Sciweavers | Sciweavers

231 search results - page 23 / 47

» Active Learning in Partially Observable Markov Decision Proc...

235

click to vote

ICASSP
2011
IEEE

153views Signal Processing» more ICASSP 2011»

Reinforcement learning for energy-efficient wireless transmission

14 years 11 months ago

Download mirlab.org

We consider the problem of energy-efficient point-to-point transmission of delay-sensitive data (e.g. multimedia data) over a fading channel. We propose a rigorous and unified fra...

Nicholas Mastronarde, Mihaela van der Schaar

claim paper

Read More »

207

Voted

ECML
2007
Springer

192views Machine Learning» more ECML 2007»

Policy Gradient Critics

16 years 1 months ago

Download www.idsia.ch

We present Policy Gradient Actor-Critic (PGAC), a new model-free Reinforcement Learning (RL) method for creating limited-memory stochastic policies for Partially Observable Markov ...

Daan Wierstra, Jürgen Schmidhuber

claim paper

Read More »

212

click to vote

ATAL
2007
Springer

145views Intelligent Agents» more ATAL 2007»

Interactive dynamic influence diagrams

15 years 11 months ago

Download www.sci.brooklyn.cuny.edu

This paper extends the framework of dynamic influence diagrams (DIDs) to the multi-agent setting. DIDs are computational representations of the Partially Observable Markov Decisio...

Kyle Polich, Piotr J. Gmytrasiewicz

claim paper

Read More »

248

click to vote

ACL
2010

175views Computational Linguistics» more ACL 2010»

Towards Relational POMDPs for Adaptive Dialogue Management

15 years 5 months ago

Download aclweb.org

Open-ended spoken interactions are typically characterised by both structural complexity and high levels of uncertainty, making dialogue management in such settings a particularly...

Pierre Lison

claim paper

Read More »

201

click to vote

ICMLA
2004

114views Machine Learning» more ICMLA 2004»

Planning with predictive state representations

15 years 8 months ago

Download www.eecs.umich.edu

Predictive state representation (PSR) models for controlled dynamical systems have recently been proposed as an alternative to traditional models such as partially observable Mark...

Michael R. James, Satinder P. Singh, Michael L. Li...

claim paper

Read More »

« Prev « First page 23 / 47 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers