Search Sciweavers | Sciweavers

1106 search results - page 125 / 222

» On regularization algorithms in learning theory

136

click to vote

AIIA
2005
Springer

114views Artificial Intelligence» more AIIA 2005»

Experimental Evaluation of Hierarchical Hidden Markov Models

15 years 10 months ago

Download www.ugogalassi.net

Building proﬁles for processes and for interactive users is a important task in intrusion detection. This paper presents the results obtained with a Hierarchical Hidden Markov Mo...

Attilio Giordana, Ugo Galassi, Lorenza Saitta

claim paper

Read More »

140

click to vote

ICML
2010
IEEE

219views Machine Learning» more ICML 2010»

Convergence of Least Squares Temporal Difference Methods Under General Conditions

15 years 5 months ago

Download www.cs.helsinki.fi

We consider approximate policy evaluation for finite state and action Markov decision processes (MDP) in the off-policy learning context and with the simulation-based least square...

Huizhen Yu

claim paper

Read More »

122

click to vote

ICC
2007
IEEE

120views Communications» more ICC 2007»

Dynamic Network Selection using Kernels

15 years 11 months ago

Download www.prism.uvsq.fr

—We present a new algorithm for vertical handover and dynamic network selection, based on a combination of multiattribute utility theory, kernel learning and stochastic gradient ...

Eric van den Berg, Praveen Gopalakrishnan, Byungsu...

claim paper

Read More »

140

click to vote

ICML
2009
IEEE

98views Machine Learning» more ICML 2009»

On primal and dual sparsity of Markov networks

16 years 5 months ago

Download www.cs.cmu.edu

Sparsity is a desirable property in high dimensional learning. The 1-norm regularization can lead to primal sparsity, while max-margin methods achieve dual sparsity. Combining the...

Jun Zhu, Eric P. Xing

claim paper

Read More »

162

click to vote

ECML
2005
Springer

193views Machine Learning» more ECML 2005»

Natural Actor-Critic

15 years 10 months ago

Download www-clmc.usc.edu

This paper investigates a novel model-free reinforcement learning architecture, the Natural Actor-Critic. The actor updates are based on stochastic policy gradients employing Amari...

Jan Peters, Sethu Vijayakumar, Stefan Schaal

claim paper

Read More »

« Prev « First page 125 / 222 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers