Search Sciweavers | Sciweavers

499 search results - page 23 / 100

» Model Minimization in Markov Decision Processes

128

click to vote

UAI
2004

135views Artificial Intelligence» more UAI 2004»

Blind Construction of Optimal Nonlinear Recursive Predictors for Discrete Sequences

15 years 6 months ago

Download uai.sis.pitt.edu

We present a new method for nonlinear prediction of discrete random sequences under minimal structural assumptions. We give a mathematical construction for optimal predictors of s...

Cosma Rohilla Shalizi, Kristina Lisa Shalizi

claim paper

Read More »

116

click to vote

ICASSP
2011
IEEE

165views Signal Processing» more ICASSP 2011»

A modified MAP criterion based on hidden Markov model for voice activity detecion

14 years 8 months ago

Download mirlab.org

The maximum a posteriori (MAP) criterion is broadly used in the statistical model-based voice activity detection (VAD) approaches. In the conventional MAP criterion, however, the ...

Shiwen Deng, Jiqing Han, Tieran Zheng, Guibin Zhen...

claim paper

Read More »

177

click to vote

ICASSP
2011
IEEE

153views Signal Processing» more ICASSP 2011»

Reinforcement learning for energy-efficient wireless transmission

14 years 8 months ago

Download mirlab.org

We consider the problem of energy-efficient point-to-point transmission of delay-sensitive data (e.g. multimedia data) over a fading channel. We propose a rigorous and unified fra...

Nicholas Mastronarde, Mihaela van der Schaar

claim paper

Read More »

175

click to vote

ENTCS
2006

134views more ENTCS 2006»

Partial Order Reduction for Probabilistic Branching Time

15 years 5 months ago

Download www.win.tue.nl

In the past, partial order reduction has been used successfully to combat the state explosion problem in the context of model checking for non-probabilistic systems. For both line...

Christel Baier, Pedro R. D'Argenio, Marcus Grö...

claim paper

Read More »

119

click to vote

COLT
2000
Springer

87views Machine Learning» more COLT 2000»

Estimation and Approximation Bounds for Gradient-Based Reinforcement Learning

15 years 9 months ago

Download www.cs.iastate.edu

We model reinforcement learning as the problem of learning to control a Partially Observable Markov Decision Process ( ¢¡¤£¦¥§ ), and focus on gradient ascent approache...

Peter L. Bartlett, Jonathan Baxter

claim paper

Read More »

« Prev « First page 23 / 100 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers