Search Sciweavers | Sciweavers

170

ICML
2010
IEEE

231views Machine Learning» more ICML 2010»

Toward Off-Policy Learning Control with Function Approximation

15 years 7 months ago

We present the first temporal-difference learning algorithm for off-policy control with unrestricted linear function approximation whose per-time-step complexity is linear in the ...

Hamid Reza Maei, Csaba Szepesvári, Shalabh ...

claim paper

Read More »

197

Voted

CEC
2011
IEEE

221views Artificial Intelligence» more CEC 2011»

Stochastic Natural Gradient Descent by estimation of empirical covariances

14 years 6 months ago

Download chrome.ws.dei.polimi.it

—Stochastic relaxation aims at ﬁnding the minimum of a ﬁtness function by identifying a proper sequence of distributions, in a given model, that minimize the expected value o...

Luigi Malagò, Matteo Matteucci, Giovanni Pi...

claim paper

Read More »

162

click to vote

ESANN
2007

148views Neural Networks» more ESANN 2007»

Applying the Episodic Natural Actor-Critic Architecture to Motor Primitive Learning

15 years 7 months ago

Download www.dice.ucl.ac.be

In this paper, we investigate motor primitive learning with the Natural Actor-Critic approach. The Natural Actor-Critic consists out of actor updates which are achieved using natur...

Jan Peters, Stefan Schaal

claim paper

Read More »

175

click to vote

NCI
2004

185views Neural Networks» more NCI 2004»

Hierarchical reinforcement learning with subpolicies specializing for learned subgoals

15 years 7 months ago

Download staff.science.uva.nl

This paper describes a method for hierarchical reinforcement learning in which high-level policies automatically discover subgoals, and low-level policies learn to specialize for ...

Bram Bakker, Jürgen Schmidhuber

claim paper

Read More »

145

click to vote

PCI
2005
Springer

114views Information Technology» more PCI 2005»

TSIC: Thermal Scheduling Simulator for Chip Multiprocessors

15 years 11 months ago

Download www2.cs.ucy.ac.cy

Abstract. Increased power density, hot-spots, and temperature gradients are severe limiting factors for today’s state-of-the-art microprocessors. However, the ﬂexibility oﬀer...

Kyriakos Stavrou, Pedro Trancoso

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers