Search Sciweavers | Sciweavers

1414 search results - page 200 / 283

» Randomness and Universal Machines

132

click to vote

ICML
2009
IEEE

222views Machine Learning» more ICML 2009»

Unsupervised hierarchical modeling of locomotion styles

16 years 4 months ago

Download www.cs.dartmouth.edu

This paper describes an unsupervised learning technique for modeling human locomotion styles, such as distinct related activities (e.g. running and striding) or variations of the ...

Wei Pan, Lorenzo Torresani

claim paper

Read More »

120

click to vote

ICML
2007
IEEE

153views Machine Learning» more ICML 2007»

Comparisons of sequence labeling algorithms and extensions

16 years 4 months ago

Download www.machinelearning.org

In this paper, we survey the current state-ofart models for structured learning problems, including Hidden Markov Model (HMM), Conditional Random Fields (CRF), Averaged Perceptron...

Nam Nguyen, Yunsong Guo

claim paper

Read More »

101

Voted

ICML
2007
IEEE

107views Machine Learning» more ICML 2007»

Online discovery of similarity mappings

16 years 4 months ago

Download www.machinelearning.org

We consider the problem of choosing, sequentially, a map which assigns elements of a set A to a few elements of a set B. On each round, the algorithm suffers some cost associated ...

Alexander Rakhlin, Jacob Abernethy, Peter L. Bartl...

claim paper

Read More »

156

click to vote

ICML
2007
IEEE

200views Machine Learning» more ICML 2007»

Multi-task reinforcement learning: a hierarchical Bayesian approach

16 years 4 months ago

Download www.machinelearning.org

We consider the problem of multi-task reinforcement learning, where the agent needs to solve a sequence of Markov Decision Processes (MDPs) chosen randomly from a fixed but unknow...

Aaron Wilson, Alan Fern, Soumya Ray, Prasad Tadepa...

claim paper

Read More »

140

click to vote

ICML
2007
IEEE

136views Machine Learning» more ICML 2007»

Combining online and offline knowledge in UCT

16 years 4 months ago

Download www.machinelearning.org

The UCT algorithm learns a value function online using sample-based search. The TD() algorithm can learn a value function offline for the on-policy distribution. We consider three...

Sylvain Gelly, David Silver

claim paper

Read More »

« Prev « First page 200 / 283 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers