Sciweavers

373 search results - page 28 / 75
» Covariant Policy Search
Sort
View
ML
2002
ACM
133views Machine Learning» more  ML 2002»
13 years 9 months ago
Finite-time Analysis of the Multiarmed Bandit Problem
Reinforcement learning policies face the exploration versus exploitation dilemma, i.e. the search for a balance between exploring the environment to find profitable actions while t...
Peter Auer, Nicolò Cesa-Bianchi, Paul Fisch...
ECCV
2006
Springer
14 years 12 months ago
Density Estimation Using Mixtures of Mixtures of Gaussians
In this paper we present a new density estimation algorithm using mixtures of mixtures of Gaussians. The new algorithm overcomes the limitations of the popular Expectation Maximiza...
Wael Abd-Almageed, Larry S. Davis
CEC
2009
IEEE
14 years 2 months ago
Memory-enhanced Evolutionary Robotics: The Echo State Network Approach
— Interested in Evolutionary Robotics, this paper focuses on the acquisition and exploitation of memory skills. The targeted task is a well-studied benchmark problem, the Tolman ...
Cédric Hartland, Nicolas Bredeche, Mich&egr...
ICASSP
2010
IEEE
13 years 10 months ago
A novel estimation of feature-space MLLR for full-covariance models
In this paper we present a novel approach for estimating featurespace maximum likelihood linear regression (fMLLR) transforms for full-covariance Gaussian models by directly maxim...
Arnab Ghoshal, Daniel Povey, Mohit Agarwal, Pinar ...
WWW
2009
ACM
14 years 10 months ago
User-centric content freshness metrics for search engines
In order to return relevant search results, a search engine must keep its local repository synchronized to the Web, but it is usually impossible to attain perfect freshness. Hence...
Ali Dasdan, Xinh Huynh