Sciweavers

4544 search results - page 795 / 909
» Reinforcement Learning with Time
Sort
View
120
Voted
LWA
2007
15 years 4 months ago
Taking the Teacher's Perspective for User Modeling in Complex Domains
Serious games that should adapt training to the individual might benefit from methods that are developed for intelligent tutoring systems. One method, model tracing, might be used...
Christian P. Janssen, Hedderik van Rijn
SDM
2008
SIAM
134views Data Mining» more  SDM 2008»
15 years 4 months ago
Direct Density Ratio Estimation for Large-scale Covariate Shift Adaptation
Covariate shift is a situation in supervised learning where training and test inputs follow different distributions even though the functional relation remains unchanged. A common...
Yuta Tsuboi, Hisashi Kashima, Shohei Hido, Steffen...
132
Voted
MEDINFO
2007
129views Healthcare» more  MEDINFO 2007»
15 years 4 months ago
Confidentiality Preserving Audits of Electronic Medical Record Access
Failure to supply a care provider with timely access to a patient's medical record can lead to patient harm or death. As such, healthcare organizations often endow care provi...
Bradley Malin, Edoardo Airoldi
NIPS
2007
15 years 4 months ago
Predicting Brain States from fMRI Data: Incremental Functional Principal Component Regression
We propose a method for reconstruction of human brain states directly from functional neuroimaging data. The method extends the traditional multivariate regression analysis of dis...
Sennay Ghebreab, Arnold W. M. Smeulders, Pieter W....
115
Voted
NIPS
2007
15 years 4 months ago
Optimistic Linear Programming gives Logarithmic Regret for Irreducible MDPs
We present an algorithm called Optimistic Linear Programming (OLP) for learning to optimize average reward in an irreducible but otherwise unknown Markov decision process (MDP). O...
Ambuj Tewari, Peter L. Bartlett