Sciweavers

699 search results - page 10 / 140
» Online Dynamic Value System for Machine Learning
Sort
View
NIPS
1996
13 years 8 months ago
Multidimensional Triangulation and Interpolation for Reinforcement Learning
Dynamic Programming, Q-learning and other discrete Markov Decision Process solvers can be applied to continuous d-dimensional state-spaces by quantizing the state space into an arr...
Scott Davies
ICML
2009
IEEE
14 years 8 months ago
Interactively optimizing information retrieval systems as a dueling bandits problem
We present an on-line learning framework tailored towards real-time learning from observed user behavior in search engines and other information retrieval systems. In particular, ...
Yisong Yue, Thorsten Joachims
ICALT
2005
IEEE
14 years 29 days ago
MALESAbrain for Problem-Based Learning in IT Education
This paper reports MALESAbrain an intelligent online tool for problem-based learning (PBL) in IT education. The learning model of MALESAbrain is built on the notions of threshold ...
Akcell Chiang, Mohd Sapiyan Baba
ECML
2007
Springer
14 years 1 months ago
Discriminative Sequence Labeling by Z-Score Optimization
Abstract. We consider a new discriminative learning approach to sequence labeling based on the statistical concept of the Z-score. Given a training set of pairs of hidden-observed ...
Elisa Ricci, Tijl De Bie, Nello Cristianini
ICML
2009
IEEE
14 years 8 months ago
Analytic moment-based Gaussian process filtering
We propose an analytic moment-based filter for nonlinear stochastic dynamic systems modeled by Gaussian processes. Exact expressions for the expected value and the covariance matr...
Marc Peter Deisenroth, Marco F. Huber, Uwe D. Hane...