Search Sciweavers | Sciweavers

2415 search results - page 189 / 483

» Markov Processes on Curves

136

Voted

ICML
2006
IEEE

131views Machine Learning» more ICML 2006»

PAC model-free reinforcement learning

16 years 6 months ago

Download cseweb.ucsd.edu

For a Markov Decision Process with finite state (size S) and action spaces (size A per state), we propose a new algorithm--Delayed Q-Learning. We prove it is PAC, achieving near o...

Alexander L. Strehl, Lihong Li, Eric Wiewiora, Joh...

claim paper

Read More »

132

click to vote

ICML
2006
IEEE

142views Machine Learning» more ICML 2006»

An intrinsic reward mechanism for efficient exploration

16 years 6 months ago

Download www-anw.cs.umass.edu

How should a reinforcement learning agent act if its sole purpose is to efficiently learn an optimal policy for later use? In other words, how should it explore, to be able to exp...

Özgür Simsek, Andrew G. Barto

claim paper

Read More »

136

click to vote

CAV
2009
Springer

156views Hardware» more CAV 2009»

Towards Performance Prediction of Compositional Models in Industrial GALS Designs

15 years 12 months ago

Download ftp.inrialpes.fr

Systems and Networks on Chips (NoCs) are a prime design focus of many hardware manufacturers. In addition to functional veriﬁcation, which is a diﬃcult necessity, the chip desi...

Nicolas Coste, Holger Hermanns, Etienne Lantreibec...

claim paper

Read More »

131

click to vote

ICASSP
2009
IEEE

128views Signal Processing» more ICASSP 2009»

Bayesian sparse image reconstruction for MRFM

15 years 12 months ago

Download www.eecs.umich.edu

In this paper, we propose a Bayesian model and a Monte Carlo Markov chain (MCMC) algorithm for reconstructing images that consist of only few non-zero pixels. An appropriate distr...

Nicolas Dobigeon, Alfred O. Hero, Jean-Yves Tourne...

claim paper

Read More »

152

click to vote

ICASSP
2009
IEEE

152views Signal Processing» more ICASSP 2009»

Generalized Baum-Welch algorithm for discriminative training on large vocabulary continuous speech recognition system

15 years 12 months ago

Download www.cs.cmu.edu

We propose a new optimization algorithm called Generalized Baum Welch (GBW) algorithm for discriminative training on hidden Markov model (HMM). GBW is based on Lagrange relaxation...

Roger Hsiao, Yik-Cheung Tam, Tanja Schultz

claim paper

Read More »

« Prev « First page 189 / 483 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers