Search Sciweavers | Sciweavers

95 search results - page 12 / 19

» Policy Gradients for Cryptanalysis

201

click to vote

ICML
2010
IEEE

231views Machine Learning» more ICML 2010»

Toward Off-Policy Learning Control with Function Approximation

15 years 8 months ago

Download www.sztaki.hu

We present the first temporal-difference learning algorithm for off-policy control with unrestricted linear function approximation whose per-time-step complexity is linear in the ...

Hamid Reza Maei, Csaba Szepesvári, Shalabh ...

claim paper

Read More »

225

click to vote

CEC
2011
IEEE

221views Artificial Intelligence» more CEC 2011»

Stochastic Natural Gradient Descent by estimation of empirical covariances

14 years 6 months ago

Download chrome.ws.dei.polimi.it

—Stochastic relaxation aims at ﬁnding the minimum of a ﬁtness function by identifying a proper sequence of distributions, in a given model, that minimize the expected value o...

Luigi Malagò, Matteo Matteucci, Giovanni Pi...

claim paper

Read More »

186

click to vote

ESANN
2007

148views Neural Networks» more ESANN 2007»

Applying the Episodic Natural Actor-Critic Architecture to Motor Primitive Learning

15 years 8 months ago

Download www.dice.ucl.ac.be

In this paper, we investigate motor primitive learning with the Natural Actor-Critic approach. The Natural Actor-Critic consists out of actor updates which are achieved using natur...

Jan Peters, Stefan Schaal

claim paper

Read More »

157

click to vote

PCI
2005
Springer

114views Information Technology» more PCI 2005»

TSIC: Thermal Scheduling Simulator for Chip Multiprocessors

16 years 9 days ago

Download www2.cs.ucy.ac.cy

Abstract. Increased power density, hot-spots, and temperature gradients are severe limiting factors for today’s state-of-the-art microprocessors. However, the ﬂexibility oﬀer...

Kyriakos Stavrou, Pedro Trancoso

claim paper

Read More »

216

click to vote

AAAI
2010

199views Intelligent Agents» more AAAI 2010»

Bayesian Policy Search for Multi-Agent Role Discovery

15 years 8 months ago

Download web.engr.oregonstate.edu

Bayesian inference is an appealing approach for leveraging prior knowledge in reinforcement learning (RL). In this paper we describe an algorithm for discovering different classes...

Aaron Wilson, Alan Fern, Prasad Tadepalli

claim paper

Read More »

« Prev « First page 12 / 19 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers