Sciweavers

4544 search results - page 199 / 909
» Reinforcement Learning with Time
Sort
View
JMLR
2010
189views more  JMLR 2010»
13 years 5 months ago
Adaptive Step-size Policy Gradients with Average Reward Metric
In this paper, we propose a novel adaptive step-size approach for policy gradient reinforcement learning. A new metric is defined for policy gradients that measures the effect of ...
Takamitsu Matsubara, Tetsuro Morimura, Jun Morimot...
ECML
2004
Springer
14 years 3 months ago
Adaptive Online Time Allocation to Search Algorithms
Matteo Gagliolo, Viktor Zhumatiy, Jürgen Schm...
ECML
2004
Springer
14 years 3 months ago
Constructive Induction for Classifying Time Series
Mohammed Waleed Kadous, Claude Sammut
ICML
1998
IEEE
14 years 11 months ago
Intra-Option Learning about Temporally Abstract Actions
tion Learning about Temporally Abstract Actions Richard S. Sutton Department of Computer Science University of Massachusetts Amherst, MA 01003-4610 rich@cs.umass.edu Doina Precup D...
Richard S. Sutton, Doina Precup, Satinder P. Singh
PAM
2011
Springer
13 years 1 months ago
Peeling Away Timing Error in NetFlow Data
In this paper, we characterize, quantify, and correct timing errors introduced into network flow data by collection and export via Cisco NetFlow version 9. We find that while som...
Brian Trammell, Bernhard Tellenbach, Dominik Schat...