Search Sciweavers | Sciweavers

377 search results - page 49 / 76

» Optimizing Production Manufacturing Using Reinforcement Lear...

168

click to vote

NIPS
2004

92views Information Technology» more NIPS 2004»

Responding to Modalities with Different Latencies

15 years 8 months ago

Download books.nips.cc

Motor control depends on sensory feedback in multiple modalities with different latencies. In this paper we consider within the framework of reinforcement learning how different s...

Fredrik Bissmarck, Hiroyuki Nakahara, Kenji Doya, ...

claim paper

Read More »

144

click to vote

WSC
1998

124views Modeling And Simulation» more WSC 1998»

Adaptive Stochastic Manpower Scheduling

15 years 8 months ago

Download www.informs-sim.org

Bayesian forecasting models provide distributional estimates for random parameters, and relative to classical schemes, have the advantage that they can rapidly capture changes in ...

Elmira Popova, David P. Morton

claim paper

Read More »

156

click to vote

EOR
2007

82views more EOR 2007»

Minimizing makespan with multiple-orders-per-job in a two-machine flowshop

15 years 6 months ago

Download www.mistaconference.org

: New semiconductor wafer fabrication facilities use Front Opening Unified Pods (FOUPs) as a common unit of wafer transfer. Since the number of pods is limited due to high costs, a...

Jeffrey D. Laub, John W. Fowler, Ahmet B. Keha

claim paper

Read More »

206

click to vote

CORR
2010
Springer

100views Education» more CORR 2010»

Products of Weighted Logic Programs

15 years 7 months ago

Download www.cs.cmu.edu

Abstract. Weighted logic programming, a generalization of bottom-up logic programming, is a successful framework for specifying dynamic programming algorithms. In this setting, pro...

Shay B. Cohen, Robert J. Simmons, Noah A. Smith

claim paper

Read More »

158

click to vote

AAAI
2008

207views Intelligent Agents» more AAAI 2008»

Adaptive Importance Sampling with Automatic Model Selection in Value Function Approximation

15 years 9 months ago

Download sugiyama-www.cs.titech.ac.jp

Off-policy reinforcement learning is aimed at efficiently reusing data samples gathered in the past, which is an essential problem for physically grounded AI as experiments are us...

Hirotaka Hachiya, Takayuki Akiyama, Masashi Sugiya...

claim paper

Read More »

« Prev « First page 49 / 76 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers