Search Sciweavers | Sciweavers

1062 search results - page 63 / 213

» Sublinear Optimization for Machine Learning

click to vote

ICML
2009
IEEE

104views Machine Learning» more ICML 2009»

Learning when to stop thinking and do something!

14 years 9 months ago

Download www.cs.ualberta.ca

An anytime algorithm is capable of returning a response to the given task at essentially any time; typically the quality of the response improves as the time increases. Here, we c...

Barnabás Póczos, Csaba Szepesv&aacut...

claim paper

Read More »

click to vote

ICML
2009
IEEE

197views Machine Learning» more ICML 2009»

Robust feature extraction via information theoretic learning

14 years 9 months ago

Download www.cbsr.ia.ac.cn

In this paper, we present a robust feature extraction framework based on informationtheoretic learning. Its formulated objective aims at simultaneously maximizing the Renyi's...

Xiaotong Yuan, Bao-Gang Hu

claim paper

Read More »

click to vote

ECML
2005
Springer

124views Machine Learning» more ECML 2005»

Active Learning for Probability Estimation Using Jensen-Shannon Divergence

14 years 2 months ago

Download userweb.cs.utexas.edu

Active selection of good training examples is an important approach to reducing data-collection costs in machine learning; however, most existing methods focus on maximizing classi...

Prem Melville, Stewart M. Yang, Maytal Saar-Tsecha...

claim paper

Read More »

click to vote

ICML
1994
IEEE

152views Machine Learning» more ICML 1994»

Markov Games as a Framework for Multi-Agent Reinforcement Learning

14 years 16 days ago

Download www.cs.rutgers.edu

In the Markov decision process (MDP) formalization of reinforcement learning, a single adaptive agent interacts with an environment defined by a probabilistic transition function....

Michael L. Littman

claim paper

Read More »

click to vote

NIPS
2008

149views Information Technology» more NIPS 2008»

Optimization on a Budget: A Reinforcement Learning Approach

13 years 10 months ago

Download www.cs.arizona.edu

Many popular optimization algorithms, like the Levenberg-Marquardt algorithm (LMA), use heuristic-based "controllers" that modulate the behavior of the optimizer during ...

Paul Ruvolo, Ian R. Fasel, Javier R. Movellan

claim paper

Read More »

« Prev « First page 63 / 213 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers