Search Sciweavers | Sciweavers

152 search results - page 31 / 31

» A game-based abstraction-refinement framework for Markov dec...

170

Voted

ICML
2001
IEEE

185views Machine Learning» more ICML 2001»

Off-Policy Temporal Difference Learning with Function Approximation

16 years 6 months ago

Download www.cs.ualberta.ca

We introduce the first algorithm for off-policy temporal-difference learning that is stable with linear function approximation. Off-policy learning is of interest because it forms...

Doina Precup, Richard S. Sutton, Sanjoy Dasgupta

claim paper

Read More »

167

click to vote

INFOCOM
2009
IEEE

153views Communications» more INFOCOM 2009»

Delay-Optimal Opportunistic Scheduling and Approximations: The Log Rule

16 years 18 days ago

Download users.ece.utexas.edu

—This paper considers the design of opportunistic packet schedulers for users sharing a time-varying wireless channel from the performance and the robustness points of view. Firs...

Bilal Sadiq, Seung Jun Baek, Gustavo de Veciana

claim paper

Read More »

« Prev « First page 31 / 31 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers