Sciweavers

4544 search results - page 199 / 909
» Reinforcement Learning with Time
Sort
View
156
Voted
JMLR
2010
189views more  JMLR 2010»
14 years 9 months ago
Adaptive Step-size Policy Gradients with Average Reward Metric
In this paper, we propose a novel adaptive step-size approach for policy gradient reinforcement learning. A new metric is defined for policy gradients that measures the effect of ...
Takamitsu Matsubara, Tetsuro Morimura, Jun Morimot...
87
Voted
ECML
2004
Springer
15 years 8 months ago
Adaptive Online Time Allocation to Search Algorithms
Matteo Gagliolo, Viktor Zhumatiy, Jürgen Schm...
70
Voted
ECML
2004
Springer
15 years 8 months ago
Constructive Induction for Classifying Time Series
Mohammed Waleed Kadous, Claude Sammut
117
Voted
ICML
1998
IEEE
16 years 3 months ago
Intra-Option Learning about Temporally Abstract Actions
tion Learning about Temporally Abstract Actions Richard S. Sutton Department of Computer Science University of Massachusetts Amherst, MA 01003-4610 rich@cs.umass.edu Doina Precup D...
Richard S. Sutton, Doina Precup, Satinder P. Singh
145
Voted
PAM
2011
Springer
14 years 5 months ago
Peeling Away Timing Error in NetFlow Data
In this paper, we characterize, quantify, and correct timing errors introduced into network flow data by collection and export via Cisco NetFlow version 9. We find that while som...
Brian Trammell, Bernhard Tellenbach, Dominik Schat...