Search Sciweavers | Sciweavers

34

ICML
2006
IEEE

131views Machine Learning» more ICML 2006»

14 years 10 months ago

For a Markov Decision Process with finite state (size S) and action spaces (size A per state), we propose a new algorithm--Delayed Q-Learning. We prove it is PAC, achieving near o...

Alexander L. Strehl, Lihong Li, Eric Wiewiora, Joh...

claim paper

Read More »

26

click to vote

ICML
2005
IEEE

136views Machine Learning» more ICML 2005»

Learning as search optimization: approximate large margin methods for structured prediction

14 years 10 months ago

Download www.isi.edu

Mappings to structured output spaces (strings, trees, partitions, etc.) are typically learned using extensions of classification algorithms to simple graphical structures (eg., li...

Daniel Marcu, Hal Daumé III

claim paper

Read More »

47

click to vote

PERCOM
2004
ACM

154views Computer Networks» more PERCOM 2004»

Towards Scalable P2P Computing for Mobile Ad Hoc Networks

14 years 9 months ago

Download www.cl.cam.ac.uk

In mobile ad hoc networks, nodes interact peer-to-peer. They self-organize, share workloads and provide services that they also use. There are middleware platforms, designed for t...

Marco Conti, Enrico Gregori, Giovanni Turi

claim paper

Read More »

25

click to vote

ALT
2009
Springer

128views Machine Learning» more ALT 2009»

Pure Exploration in Multi-armed Bandits Problems

14 years 6 months ago

Download sequel.futurs.inria.fr

Abstract. We consider the framework of stochastic multi-armed bandit problems and study the possibilities and limitations of strategies that explore sequentially the arms. The stra...

Sébastien Bubeck, Rémi Munos, Gilles...

claim paper

Read More »

30

click to vote

IPPS
2009
IEEE

101views Distributed And Parallel Com...» more IPPS 2009»

Resource allocation strategies for constructive in-network stream processing

14 years 4 months ago

Download navet.ics.hawaii.edu

We consider the operator mapping problem for in-network stream processing, i.e., the application of a tree of operators in steady-state to multiple data objects that are continuou...

Anne Benoit, Henri Casanova, Veronika Rehn-Sonigo,...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers