Sciweavers

3395 search results - page 465 / 679
» Learning to efficiently rank
Sort
View
ALT
2010
Springer
15 years 7 months ago
Optimal Online Prediction in Adversarial Environments
: In many prediction problems, including those that arise in computer security and computational finance, the process generating the data is best modeled as an adversary with whom ...
Peter L. Bartlett
UAI
2008
15 years 7 months ago
CORL: A Continuous-state Offset-dynamics Reinforcement Learner
Continuous state spaces and stochastic, switching dynamics characterize a number of rich, realworld domains, such as robot navigation across varying terrain. We describe a reinfor...
Emma Brunskill, Bethany R. Leffler, Lihong Li, Mic...
VVEIS
2008
15 years 7 months ago
The Linear Conditional Probability Matrix Generator for IT Governance Performance Prediction
The goal of IT governance is not only to achieve internal efficiency in an IT organization, but also to support IT's role as a business enabler. The latter is here denoted IT ...
Mårten Simonsson, Robert Lagerström, Po...
ESANN
2004
15 years 7 months ago
High-accuracy value-function approximation with neural networks applied to the acrobot
Several reinforcement-learning techniques have already been applied to the Acrobot control problem, using linear function approximators to estimate the value function. In this pape...
Rémi Coulom
FLAIRS
2006
15 years 7 months ago
Managing Student Emotions in Intelligent Tutoring Systems
1 In the classic educational context, observing and identifying learner's emotional response allow the teacher to adapt the lesson, with the aim of improving the quality of th...
Roger Nkambou