Search Sciweavers | Sciweavers

908 search results - page 77 / 182

» Stochastic Finite Learning

216

click to vote

AAAI
2008

169views Intelligent Agents» more AAAI 2008»

Perpetual Learning for Non-Cooperative Multiple Agents

15 years 9 months ago

Download www.aaai.org

This paper examines, by argument, the dynamics of sequences of behavioural choices made, when non-cooperative restricted-memory agents learn in partially observable stochastic gam...

Luke Dickens

claim paper

Read More »

208

click to vote

ICML
2009
IEEE

151views Machine Learning» more ICML 2009»

Robot trajectory optimization using approximate inference

16 years 8 months ago

Download user.cs.tu-berlin.de

The general stochastic optimal control (SOC) problem in robotics scenarios is often too complex to be solved exactly and in near real time. A classical approximate solution is to ...

Marc Toussaint

claim paper

Read More »

194

click to vote

ICML
2007
IEEE

197views Machine Learning» more ICML 2007»

A permutation-augmented sampler for DP mixture models

16 years 8 months ago

Download www.machinelearning.org

We introduce a new inference algorithm for Dirichlet process mixture models. While Gibbs sampling and variational methods focus on local moves, the new algorithm makes more global...

Percy Liang, Michael I. Jordan, Benjamin Taskar

claim paper

Read More »

187

click to vote

ICML
2008
IEEE

169views Machine Learning» more ICML 2008»

Large scale manifold transduction

16 years 8 months ago

Download ronan.collobert.com

We show how the regularizer of Transductive Support Vector Machines (TSVM) can be trained by stochastic gradient descent for linear models and multi-layer architectures. The resul...

Michael Karlen, Jason Weston, Ayse Erkan, Ronan Co...

claim paper

Read More »

202

click to vote

ICML
2010
IEEE

193views Machine Learning» more ICML 2010»

Efficient Selection of Multiple Bandit Arms: Theory and Practice

15 years 8 months ago

Download www.cs.utexas.edu

We consider the general, widely applicable problem of selecting from n real-valued random variables a subset of size m of those with the highest means, based on as few samples as ...

Shivaram Kalyanakrishnan, Peter Stone

claim paper

Read More »

« Prev « First page 77 / 182 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers