Search Sciweavers | Sciweavers

63 search results - page 6 / 13

» Mean field for Markov Decision Processes: from Discrete to C...

click to vote

ICASSP
2011
IEEE

184views Signal Processing» more ICASSP 2011»

Multi-view and multi-objective semi-supervised learning for large vocabulary continuous speech recognition

12 years 11 months ago

Download mirlab.org

Current hidden Markov acoustic modeling for large vocabulary continuous speech recognition (LVCSR) relies on the availability of abundant labeled transcriptions. Given that speech...

Xiaodong Cui, Jing Huang, Jen-Tzung Chien

claim paper

Read More »

click to vote

IPCO
2010

125views Optimization» more IPCO 2010»

A Pumping Algorithm for Ergodic Stochastic Mean Payoff Games with Perfect Information

13 years 9 months ago

Download www.mpi-inf.mpg.de

Abstract. We consider two-person zero-sum stochastic mean payoff games with perfect information, or BWR-games, given by a digraph G = (V = VB VW VR, E), with local rewards r : E R...

Endre Boros, Khaled M. Elbassioni, Vladimir Gurvic...

claim paper

Read More »

click to vote

NHM
2010

87views more NHM 2010»

The coolest path problem

13 years 2 months ago

Download www.hausdorff-research-institute.uni-bonn.de

We introduce the coolest path problem, which is a mixture of two well-known problems from distinct mathematical fields. One of them is the shortest path problem from combinatorial ...

Martin Frank, Armin Fügenschuh, Michael Herty...

claim paper

Read More »

click to vote

JMLR
2006

124views more JMLR 2006»

Policy Gradient in Continuous Time

13 years 7 months ago

Download hal.inria.fr

Policy search is a method for approximately solving an optimal control problem by performing a parametric optimization search in a given class of parameterized policies. In order ...

Rémi Munos

claim paper

Read More »

click to vote

ICML
2007
IEEE

204views Machine Learning» more ICML 2007»

Constructing basis functions from directed graphs for value function approximation

14 years 8 months ago

Download www.machinelearning.org

Basis functions derived from an undirected graph connecting nearby samples from a Markov decision process (MDP) have proven useful for approximating value functions. The success o...

Jeffrey Johns, Sridhar Mahadevan

claim paper

Read More »

« Prev « First page 6 / 13 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers