Search Sciweavers | Sciweavers

147 search results - page 14 / 30

» Policy Gradient in Continuous Time

151

click to vote

ISLPED
2005
ACM

122views Hardware» more ISLPED 2005»

A simple mechanism to adapt leakage-control policies to temperature

15 years 8 months ago

Download pages.cs.wisc.edu

Leakage power reduction in cache memories continues to be a critical area of research because of the promise of a significant pay-off. Various techniques have been developed so fa...

Stefanos Kaxiras, Polychronis Xekalakis, Georgios ...

claim paper

Read More »

143

click to vote

NIPS
2003

207views Information Technology» more NIPS 2003»

Extending Q-Learning to General Adaptive Multi-Agent Systems

15 years 4 months ago

Download books.nips.cc

Recent multi-agent extensions of Q-Learning require knowledge of other agents’ payoffs and Q-functions, and assume game-theoretic play at all times by all other agents. This pap...

Gerald Tesauro

claim paper

Read More »

116

click to vote

WSC
2008

156views Modeling And Simulation» more WSC 2008»

Supply chain risks analysis by using jump-diffusion model

15 years 5 months ago

Download www.informs-sim.org

This paper investigates the effects of demand risk on the performance of supply chain in continuous time setting. The inventory level has been modeled as a jump-diffusion process ...

Xianzhe Chen, Jun Zhang

claim paper

Read More »

105

click to vote

ESANN
2007

122views Neural Networks» more ESANN 2007»

The Recurrent Control Neural Network

15 years 4 months ago

Download www.dice.ucl.ac.be

This paper presents our Recurrent Control Neural Network (RCNN), which is a model-based approach for a data-eﬃcient modelling and control of reinforcement learning problems in di...

Anton Maximilian Schäfer, Steffen Udluft, Han...

claim paper

Read More »

128

click to vote

ICASSP
2011
IEEE

136views Signal Processing» more ICASSP 2011»

SRF: Matrix completion based on smoothed rank function

14 years 7 months ago

Download perception.csl.uiuc.edu

In this paper, we address the matrix completion problem and propose a novel algorithm based on a smoothed rank function (SRF) approximation. Among available algorithms like FPCA a...

Hooshang Ghasemi, Mohmmadreza Malek-Mohammadi, Mas...

claim paper

Read More »

« Prev « First page 14 / 30 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers