Search Sciweavers | Sciweavers

526 search results - page 37 / 106

» Efficient Algorithms for Online Decision Problems

click to vote

ECML
2006
Springer

112views Machine Learning» more ECML 2006»

Bandit Based Monte-Carlo Planning

14 years 1 months ago

Download www.lri.fr

Abstract. For large state-space Markovian Decision Problems MonteCarlo planning is one of the few viable approaches to find near-optimal solutions. In this paper we introduce a new...

Levente Kocsis, Csaba Szepesvári

claim paper

Read More »

click to vote

ALGORITHMICA
2002

84views more ALGORITHMICA 2002»

On-Line Multi-Threaded Paging

13 years 10 months ago

Download www-2.dc.uba.ar

In this paper we introduce a generalization of Paging to the case where there are many threads of requests. This models situations in which the requests come from more than one ind...

Esteban Feuerstein, Alejandro Strejilevich de Loma

claim paper

Read More »

click to vote

CORR
2010
Springer

152views Education» more CORR 2010»

Neuroevolutionary optimization

13 years 10 months ago

Download jmlr.csail.mit.edu

Temporal difference methods are theoretically grounded and empirically effective methods for addressing reinforcement learning problems. In most real-world reinforcement learning ...

Eva Volná

claim paper

Read More »

click to vote

CCGRID
2009
IEEE

136views Distributed And Parallel Com...» more CCGRID 2009»

Efficient Grid Task-Bundle Allocation Using Bargaining Based Self-Adaptive Auction

13 years 11 months ago

Download www.s3lab.ece.ufl.edu

To address coordination and complexity issues, we formulate a grid task allocation problem as a bargaining based self-adaptive auction and propose the BarSAA grid task-bundle alloc...

Han Zhao, Xiaolin Li

claim paper

Read More »

click to vote

PKDD
2010
Springer

179views Data Mining» more PKDD 2010»

Gaussian Processes for Sample Efficient Reinforcement Learning with RMAX-Like Exploration

13 years 8 months ago

Download www.cs.utexas.edu

Abstract. We present an implementation of model-based online reinforcement learning (RL) for continuous domains with deterministic transitions that is specifically designed to achi...

Tobias Jung, Peter Stone

claim paper

Read More »

« Prev « First page 37 / 106 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers