Search Sciweavers | Sciweavers

50 search results - page 9 / 10

» Convergence and Divergence in Standard and Averaging Reinfor...

click to vote

SIGDIAL
2010

158views Natural Language Processing» more SIGDIAL 2010»

Sparse Approximate Dynamic Programming for Dialog Management

13 years 5 months ago

Download www.sigdial.org

Spoken dialogue management strategy optimization by means of Reinforcement Learning (RL) is now part of the state of the art. Yet, there is still a clear mismatch between the comp...

Senthilkumar Chandramohan, Matthieu Geist, Olivier...

claim paper

Read More »

click to vote

AUSAI
2005
Springer

166views Artificial Intelligence» more AUSAI 2005»

Adaptive Utility-Based Scheduling in Resource-Constrained Systems

14 years 1 months ago

Download labs.oracle.com

This paper addresses the problem of scheduling jobs in soft real-time systems, where the utility of completing each job decreases over time. We present a utility-based framework fo...

David Vengerov

claim paper

Read More »

click to vote

ECCC
2007

180views more ECCC 2007»

Adaptive Algorithms for Online Decision Problems

13 years 7 months ago

Download ftp.cs.princeton.edu

We study the notion of learning in an oblivious changing environment. Existing online learning algorithms which minimize regret are shown to converge to the average of all locally...

Elad Hazan, C. Seshadhri

claim paper

Read More »

click to vote

JAIR
2008

135views more JAIR 2008»

13 years 7 months ago

On Similarities between Inference in Game Theory and Machine Learning

Download www.maths.bris.ac.uk

In this paper, we elucidate the equivalence between inference in game theory and machine learning. Our aim in so doing is to establish an equivalent vocabulary between the two dom...

Iead Rezek, David S. Leslie, Steven Reece, Stephen...

claim paper

Read More »

click to vote

NIPS
2004

95views Information Technology» more NIPS 2004»

Newscast EM

13 years 9 months ago

Download books.nips.cc

We propose a gossip-based distributed algorithm for Gaussian mixture learning, Newscast EM. The algorithm operates on network topologies where each node observes a local quantity ...

Wojtek Kowalczyk, Nikos A. Vlassis

claim paper

Read More »

« Prev « First page 9 / 10 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers