Sciweavers

358 search results - page 33 / 72
» Online Testing with Reinforcement Learning
Sort
View
JMLR
2010
189views more  JMLR 2010»
13 years 4 months ago
Adaptive Step-size Policy Gradients with Average Reward Metric
In this paper, we propose a novel adaptive step-size approach for policy gradient reinforcement learning. A new metric is defined for policy gradients that measures the effect of ...
Takamitsu Matsubara, Tetsuro Morimura, Jun Morimot...
LREC
2010
217views Education» more  LREC 2010»
13 years 11 months ago
The Dictionary of Italian Collocations: Design and Integration in an Online Learning Environment
In this paper, I introduce the DICI, an electronic dictionary of Italian collocations designed to support the acquisition of the collocational competence in learners of Italian as...
Stefania Spina
INTERSPEECH
2010
13 years 4 months ago
Online adaptive learning for speech recognition decoding
We describe a new method for pruning in dynamic models based on running an adaptive filtering algorithm online during decoding to predict aspects of the scores in the near future....
Jeff Bilmes, Hui Lin
SIGCSE
2004
ACM
112views Education» more  SIGCSE 2004»
14 years 3 months ago
Using software testing to move students from trial-and-error to reflection-in-action
Introductory computer science students rely on a trial and error approach to fixing errors and debugging for too long. Moving to a reflection in action strategy can help students ...
Stephen H. Edwards
ITC
2003
IEEE
114views Hardware» more  ITC 2003»
14 years 3 months ago
Test-Based Model Generation For Legacy Systems
We study the extension of applicability of system-level testing techniques to the construction of a consistent model of (legacy) systems under test, which are seen as black boxes....
Hardi Hungar, Tiziana Margaria, Bernhard Steffen