Search Sciweavers | Sciweavers

91 search results - page 11 / 19

» Parameter-exploring policy gradients

164

click to vote

DATE
2008
IEEE

99views Hardware» more DATE 2008»

Thermal Balancing Policy for Streaming Computing on Multiprocessor Architectures

16 years 20 days ago

Download www.date-conference.com

As feature sizes decrease, power dissipation and heat generation density exponentially increase. Thus, temperature gradients in Multiprocessor Systems on Chip (MPSoCs) can serious...

Fabrizio Mulas, Michele Pittau, Marco Buttu, Salva...

claim paper

Read More »

191

click to vote

JMLR
2010

227views more JMLR 2010»

PyBrain

15 years 4 months ago

Download www.idsia.ch

PyBrain is a versatile machine learning library for Python. Its goal is to provide ﬂexible, easyto-use yet still powerful algorithms for machine learning tasks, including a vari...

Tom Schaul, Justin Bayer, Daan Wierstra, Yi Sun, M...

claim paper

Read More »

162

click to vote

ICANNGA
2007
Springer

105views Algorithms» more ICANNGA 2007»

Reinforcement Learning in Fine Time Discretization

16 years 11 days ago

Download staff.elka.pw.edu.pl

Reinforcement Learning (RL) is analyzed here as a tool for control system optimization. State and action spaces are assumed to be continuous. Time is assumed to be discrete, yet th...

Pawel Wawrzynski

claim paper

Read More »

171

click to vote

NIPS
1998

140views Information Technology» more NIPS 1998»

Gradient Descent for General Reinforcement Learning

15 years 7 months ago

Download www.ri.cmu.edu

A simple learning rule is derived, the VAPS algorithm, which can be instantiated to generate a wide range of new reinforcementlearning algorithms. These algorithms solve a number ...

Leemon C. Baird III, Andrew W. Moore

claim paper

Read More »

176

click to vote

CIS
2005
Springer

129views Applied Computing» more CIS 2005»

An RLS-Based Natural Actor-Critic Algorithm for Locomotion of a Two-Linked Robot Arm

15 years 11 months ago

Download www-clmc.usc.edu

Recently, actor-critic methods have drawn much interests in the area of reinforcement learning, and several algorithms have been studied along the line of the actor-critic strategy...

Jooyoung Park, Jongho Kim, Daesung Kang

claim paper

Read More »

« Prev « First page 11 / 19 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers