Search Sciweavers | Sciweavers

115 search results - page 13 / 23

» Recurrent policy gradients

154

click to vote

DATE
2008
IEEE

99views Hardware» more DATE 2008»

Thermal Balancing Policy for Streaming Computing on Multiprocessor Architectures

15 years 12 months ago

Download www.date-conference.com

As feature sizes decrease, power dissipation and heat generation density exponentially increase. Thus, temperature gradients in Multiprocessor Systems on Chip (MPSoCs) can serious...

Fabrizio Mulas, Michele Pittau, Marco Buttu, Salva...

claim paper

Read More »

151

click to vote

ICANNGA
2007
Springer

105views Algorithms» more ICANNGA 2007»

Reinforcement Learning in Fine Time Discretization

15 years 11 months ago

Download staff.elka.pw.edu.pl

Reinforcement Learning (RL) is analyzed here as a tool for control system optimization. State and action spaces are assumed to be continuous. Time is assumed to be discrete, yet th...

Pawel Wawrzynski

claim paper

Read More »

161

click to vote

NIPS
1998

140views Information Technology» more NIPS 1998»

Gradient Descent for General Reinforcement Learning

15 years 6 months ago

Download www.ri.cmu.edu

A simple learning rule is derived, the VAPS algorithm, which can be instantiated to generate a wide range of new reinforcementlearning algorithms. These algorithms solve a number ...

Leemon C. Baird III, Andrew W. Moore

claim paper

Read More »

168

click to vote

CIS
2005
Springer

129views Applied Computing» more CIS 2005»

An RLS-Based Natural Actor-Critic Algorithm for Locomotion of a Two-Linked Robot Arm

15 years 11 months ago

Download www-clmc.usc.edu

Recently, actor-critic methods have drawn much interests in the area of reinforcement learning, and several algorithms have been studied along the line of the actor-critic strategy...

Jooyoung Park, Jongho Kim, Daesung Kang

claim paper

Read More »

113

click to vote

QUESTA
2007

50views more QUESTA 2007»

Stability of join-the-shortest-queue networks

15 years 4 months ago

Download www.me.utexas.edu

This paper investigates stability behavior in a variant of a generalized Jackson queueing network. In our network, some customers use a join-the-shortest-queue policy when enterin...

J. G. Dai, John J. Hasenbein, Bara Kim

claim paper

Read More »

« Prev « First page 13 / 23 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers