Search Sciweavers | Sciweavers

802 search results - page 154 / 161

» Experts in a Markov Decision Process

214

click to vote

ICML
2001
IEEE

185views Machine Learning» more ICML 2001»

Off-Policy Temporal Difference Learning with Function Approximation

16 years 8 months ago

Download www.cs.ualberta.ca

We introduce the first algorithm for off-policy temporal-difference learning that is stable with linear function approximation. Off-policy learning is of interest because it forms...

Doina Precup, Richard S. Sutton, Sanjoy Dasgupta

claim paper

Read More »

183

click to vote

ICML
1998
IEEE

165views Machine Learning» more ICML 1998»

Intra-Option Learning about Temporally Abstract Actions

16 years 8 months ago

Download www.cs.ualberta.ca

tion Learning about Temporally Abstract Actions Richard S. Sutton Department of Computer Science University of Massachusetts Amherst, MA 01003-4610 rich@cs.umass.edu Doina Precup D...

Richard S. Sutton, Doina Precup, Satinder P. Singh

claim paper

Read More »

213

click to vote

WWW
2005
ACM

211views Internet Technology» more WWW 2005»

Executing incoherency bounded continuous queries at web data aggregators

16 years 8 months ago

Download www.www2005.org

Continuous queries are used to monitor changes to time varying data and to provide results useful for online decision making. Typically a user desires to obtain the value of some ...

Rajeev Gupta, Ashish Puri, Krithi Ramamritham

claim paper

Read More »

222

click to vote

INFOCOM
2009
IEEE

153views Communications» more INFOCOM 2009»

Delay-Optimal Opportunistic Scheduling and Approximations: The Log Rule

16 years 2 months ago

Download users.ece.utexas.edu

—This paper considers the design of opportunistic packet schedulers for users sharing a time-varying wireless channel from the performance and the robustness points of view. Firs...

Bilal Sadiq, Seung Jun Baek, Gustavo de Veciana

claim paper

Read More »

192

click to vote

CPAIOR
2009
Springer

95views Operations Research» more CPAIOR 2009»

Optimal Interdiction of Unreactive Markovian Evaders

16 years 2 months ago

Download math.lanl.gov

The interdiction problem arises in a variety of areas including military logistics, infectious disease control, and counter-terrorism. In the typical formulation of network interdi...

Alexander Gutfraind, Aric A. Hagberg, Feng Pan

claim paper

Read More »

« Prev « First page 154 / 161 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers