Search Sciweavers | Sciweavers

4544 search results - page 199 / 909

» Reinforcement Learning with Time

156

Voted

JMLR
2010

189views more JMLR 2010»

Adaptive Step-size Policy Gradients with Average Reward Metric

14 years 9 months ago

Download jmlr.csail.mit.edu

In this paper, we propose a novel adaptive step-size approach for policy gradient reinforcement learning. A new metric is defined for policy gradients that measures the effect of ...

Takamitsu Matsubara, Tetsuro Morimura, Jun Morimot...

claim paper

Read More »

Voted

ECML
2004
Springer

88views Machine Learning» more ECML 2004»

Adaptive Online Time Allocation to Search Algorithms

15 years 8 months ago

Download www.idsia.ch

Matteo Gagliolo, Viktor Zhumatiy, Jürgen Schm...

claim paper

Read More »

Voted

ECML
2004
Springer

91views Machine Learning» more ECML 2004»

Constructive Induction for Classifying Time Series

15 years 8 months ago

Download etdncku.lib.ncku.edu.tw

Mohammed Waleed Kadous, Claude Sammut

claim paper

Read More »

117

Voted

ICML
1998
IEEE

165views Machine Learning» more ICML 1998»

Intra-Option Learning about Temporally Abstract Actions

16 years 3 months ago

Download www.cs.ualberta.ca

tion Learning about Temporally Abstract Actions Richard S. Sutton Department of Computer Science University of Massachusetts Amherst, MA 01003-4610 rich@cs.umass.edu Doina Precup D...

Richard S. Sutton, Doina Precup, Satinder P. Singh

claim paper

Read More »

145

Voted

PAM
2011
Springer

159views Computer Networks» more PAM 2011»

Peeling Away Timing Error in NetFlow Data

14 years 5 months ago

Download ftp.tik.ee.ethz.ch

In this paper, we characterize, quantify, and correct timing errors introduced into network ﬂow data by collection and export via Cisco NetFlow version 9. We ﬁnd that while som...

Brian Trammell, Bernhard Tellenbach, Dominik Schat...

claim paper

Read More »

« Prev « First page 199 / 909 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers