Search Sciweavers | Sciweavers

2167 search results - page 365 / 434

» Stochastic Process Algebras

124

Voted

NIPS
2001

144views Information Technology» more NIPS 2001»

Variance Reduction Techniques for Gradient Estimates in Reinforcement Learning

15 years 4 months ago

Download jmlr.csail.mit.edu

Policy gradient methods for reinforcement learning avoid some of the undesirable properties of the value function approaches, such as policy degradation (Baxter and Bartlett, 2001...

Evan Greensmith, Peter L. Bartlett, Jonathan Baxte...

claim paper

Read More »

140

Voted

NIPS
2001

192views Information Technology» more NIPS 2001»

Predictive Representations of State

15 years 4 months ago

Download www.eecs.umich.edu

We show that states of a dynamical system can be usefully represented by multi-step, action-conditional predictions of future observations. State representations that are grounded...

Michael L. Littman, Richard S. Sutton, Satinder P....

claim paper

Read More »

142

Voted

UAI
2004

195views Artificial Intelligence» more UAI 2004»

Solving Factored MDPs with Continuous and Discrete Variables

15 years 4 months ago

Download www.cs.pitt.edu

Although many real-world stochastic planning problems are more naturally formulated by hybrid models with both discrete and continuous variables, current state-of-the-art methods ...

Carlos Guestrin, Milos Hauskrecht, Branislav Kveto...

claim paper

Read More »

132

Voted

WSC
2001

103views Modeling And Simulation» more WSC 2001»

Quantile and histogram estimation

15 years 4 months ago

Download www.informs-sim.org

This paper discusses implementation of a sequential procedure to construct proportional half-width confidence intervals for a simulation estimator of the steady-state quantiles an...

E. Jack Chen, W. David Kelton

claim paper

Read More »

120

click to vote

WSC
2004

123views Modeling And Simulation» more WSC 2004»

A Near Optimal Approach to Quality of Service Data Replication Scheduling

15 years 4 months ago

Download www.informs-sim.org

This paper describes an approach to real-time decisionmaking for quality of service based scheduling of distributed asynchronous data replication. The proposed approach addresses ...

Kevin Adams, Denis Gracanin, Dusan Teodorovic

claim paper

Read More »

« Prev « First page 365 / 434 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers