Sciweavers

2415 search results - page 189 / 483
» Markov Processes on Curves
Sort
View
ICML
2006
IEEE
14 years 11 months ago
PAC model-free reinforcement learning
For a Markov Decision Process with finite state (size S) and action spaces (size A per state), we propose a new algorithm--Delayed Q-Learning. We prove it is PAC, achieving near o...
Alexander L. Strehl, Lihong Li, Eric Wiewiora, Joh...
ICML
2006
IEEE
14 years 11 months ago
An intrinsic reward mechanism for efficient exploration
How should a reinforcement learning agent act if its sole purpose is to efficiently learn an optimal policy for later use? In other words, how should it explore, to be able to exp...
Özgür Simsek, Andrew G. Barto
CAV
2009
Springer
156views Hardware» more  CAV 2009»
14 years 5 months ago
Towards Performance Prediction of Compositional Models in Industrial GALS Designs
Systems and Networks on Chips (NoCs) are a prime design focus of many hardware manufacturers. In addition to functional verification, which is a difficult necessity, the chip desi...
Nicolas Coste, Holger Hermanns, Etienne Lantreibec...
ICASSP
2009
IEEE
14 years 5 months ago
Bayesian sparse image reconstruction for MRFM
In this paper, we propose a Bayesian model and a Monte Carlo Markov chain (MCMC) algorithm for reconstructing images that consist of only few non-zero pixels. An appropriate distr...
Nicolas Dobigeon, Alfred O. Hero, Jean-Yves Tourne...
ICASSP
2009
IEEE
14 years 5 months ago
Generalized Baum-Welch algorithm for discriminative training on large vocabulary continuous speech recognition system
We propose a new optimization algorithm called Generalized Baum Welch (GBW) algorithm for discriminative training on hidden Markov model (HMM). GBW is based on Lagrange relaxation...
Roger Hsiao, Yik-Cheung Tam, Tanja Schultz