Search Sciweavers | Sciweavers

135 search results - page 12 / 27

» Bounded Parameter Markov Decision Processes

143

click to vote

ICML
2006
IEEE

131views Machine Learning» more ICML 2006»

PAC model-free reinforcement learning

16 years 6 months ago

Download cseweb.ucsd.edu

For a Markov Decision Process with finite state (size S) and action spaces (size A per state), we propose a new algorithm--Delayed Q-Learning. We prove it is PAC, achieving near o...

Alexander L. Strehl, Lihong Li, Eric Wiewiora, Joh...

claim paper

Read More »

150

click to vote

JMLR
2006

143views more JMLR 2006»

Geometric Variance Reduction in Markov Chains: Application to Value Function and Gradient Estimation

15 years 5 months ago

Download www.aaai.org

We study a sequential variance reduction technique for Monte Carlo estimation of functionals in Markov Chains. The method is based on designing sequential control variates using s...

Rémi Munos

claim paper

Read More »

163

click to vote

ICASSP
2008
IEEE

163views Signal Processing» more ICASSP 2008»

Link throughput of multi-channel opportunistic access with limited sensing

16 years 21 hour ago

Download www.ece.ucdavis.edu

—We aim to characterize the maximum link throughput of a multi-channel opportunistic communication system. The states of these channels evolve as independent and identically dist...

Keqin Liu, Qing Zhao

claim paper

Read More »

167

click to vote

PKDD
2010
Springer

129views Data Mining» more PKDD 2010»

Smarter Sampling in Model-Based Bayesian Reinforcement Learning

15 years 4 months ago

Download www.cs.mcgill.ca

Abstract. Bayesian reinforcement learning (RL) is aimed at making more efﬁcient use of data samples, but typically uses signiﬁcantly more computation. For discrete Markov Decis...

Pablo Samuel Castro, Doina Precup

claim paper

Read More »

156

click to vote

CORR
2008
Springer

107views Education» more CORR 2008»

A Spectral Algorithm for Learning Hidden Markov Models

15 years 5 months ago

Download www.cs.mcgill.ca

Hidden Markov Models (HMMs) are one of the most fundamental and widely used statistical tools for modeling discrete time series. In general, learning HMMs from data is computation...

Daniel Hsu, Sham M. Kakade, Tong Zhang

claim paper

Read More »

« Prev « First page 12 / 27 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers