Search Sciweavers | Sciweavers

147 search results - page 9 / 30

» Policy Gradient in Continuous Time

171

click to vote

ML
2002
ACM

146views Machine Learning» more ML 2002»

Variable Resolution Discretization in Optimal Control

15 years 6 months ago

Download www.ri.cmu.edu

Abstract. The problemof state abstractionis of centralimportancein optimalcontrol,reinforcement learning and Markov decision processes. This paper studies the case of variable reso...

Rémi Munos, Andrew W. Moore

claim paper

Read More »

209

click to vote

ICML
1996
IEEE

196views Machine Learning» more ICML 1996»

A Convergent Reinforcement Learning Algorithm in the Continuous Case: The Finite-Element Reinforcement Learning

15 years 11 months ago

Download www.ri.cmu.edu

This paper presents a direct reinforcement learning algorithm, called Finite-Element Reinforcement Learning, in the continuous case, i.e. continuous state-space and time. The eval...

Rémi Munos

claim paper

Read More »

304

click to vote

TON
2010

151views more TON 2010»

Throughput Optimal Distributed Power Control of Stochastic Wireless Networks

15 years 1 months ago

Download pantheon.yale.edu

The Maximum Differential Backlog (MDB) control policy of Tassiulas and Ephremides has been shown to adaptively maximize the stable throughput of multihop wireless networks with ran...

Yufang Xi, Edmund M. Yeh

claim paper

Read More »

251

Voted

VIS
2008
IEEE

192views Visualization» more VIS 2008»

Smooth Surface Extraction from Unstructured Point-based Volume Data Using PDEs

16 years 8 months ago

Download www.math-inf.uni-greifswald.de

Abstract--Smooth surface extraction using partial differential equations (PDEs) is a well-known and widely used technique for visualizing volume data. Existing approaches operate o...

Paul Rosenthal, Lars Linsen

claim paper

Read More »

181

click to vote

TIT
2008

65views more TIT 2008»

Power-Efficient Resource Allocation for Time-Division Multiple Access Over Fading Channels

15 years 6 months ago

Download www.ee.fau.edu

We investigate resource allocation policies for time-division multiple access (TDMA) over fading channels in the power-limited regime. For frequency-flat block-fading channels and ...

Xin Wang, Georgios B. Giannakis

claim paper

Read More »

« Prev « First page 9 / 30 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers