Sciweavers

2566 search results - page 129 / 514
» Relating reinforcement learning performance to classificatio...
Sort
View
COLING
2010
14 years 11 months ago
Comparison of different algebras for inducing the temporal structure of texts
This paper investigates the impact of using different temporal algebras for learning temporal relations between events. Specifically, we compare three intervalbased algebras: Alle...
Pascal Denis, Philippe Muller
ICML
2009
IEEE
16 years 5 months ago
Constraint relaxation in approximate linear programs
Approximate Linear Programming (ALP) is a reinforcement learning technique with nice theoretical properties, but it often performs poorly in practice. We identify some reasons for...
Marek Petrik, Shlomo Zilberstein
ICML
2010
IEEE
15 years 5 months ago
Internal Rewards Mitigate Agent Boundedness
Abstract--Reinforcement learning (RL) research typically develops algorithms for helping an RL agent best achieve its goals-however they came to be defined--while ignoring the rela...
Jonathan Sorg, Satinder P. Singh, Richard Lewis
JMLR
2010
189views more  JMLR 2010»
14 years 11 months ago
Adaptive Step-size Policy Gradients with Average Reward Metric
In this paper, we propose a novel adaptive step-size approach for policy gradient reinforcement learning. A new metric is defined for policy gradients that measures the effect of ...
Takamitsu Matsubara, Tetsuro Morimura, Jun Morimot...
ICCS
2007
Springer
15 years 10 months ago
Text Classification with Support Vector Machine and Back Propagation Neural Network
Abstract. We compared a support vector machine (SVM) with a back propagation neural network (BPNN) for the task of text classification of XiangShan science conference (XSSC) web do...
Wen Zhang, Xijin Tang, Taketoshi Yoshida