Sciweavers

4544 search results - page 795 / 909
» Reinforcement Learning with Time
Sort
View
LWA
2007
13 years 11 months ago
Taking the Teacher's Perspective for User Modeling in Complex Domains
Serious games that should adapt training to the individual might benefit from methods that are developed for intelligent tutoring systems. One method, model tracing, might be used...
Christian P. Janssen, Hedderik van Rijn
SDM
2008
SIAM
134views Data Mining» more  SDM 2008»
13 years 11 months ago
Direct Density Ratio Estimation for Large-scale Covariate Shift Adaptation
Covariate shift is a situation in supervised learning where training and test inputs follow different distributions even though the functional relation remains unchanged. A common...
Yuta Tsuboi, Hisashi Kashima, Shohei Hido, Steffen...
MEDINFO
2007
129views Healthcare» more  MEDINFO 2007»
13 years 11 months ago
Confidentiality Preserving Audits of Electronic Medical Record Access
Failure to supply a care provider with timely access to a patient's medical record can lead to patient harm or death. As such, healthcare organizations often endow care provi...
Bradley Malin, Edoardo Airoldi
NIPS
2007
13 years 11 months ago
Predicting Brain States from fMRI Data: Incremental Functional Principal Component Regression
We propose a method for reconstruction of human brain states directly from functional neuroimaging data. The method extends the traditional multivariate regression analysis of dis...
Sennay Ghebreab, Arnold W. M. Smeulders, Pieter W....
NIPS
2007
13 years 11 months ago
Optimistic Linear Programming gives Logarithmic Regret for Irreducible MDPs
We present an algorithm called Optimistic Linear Programming (OLP) for learning to optimize average reward in an irreducible but otherwise unknown Markov decision process (MDP). O...
Ambuj Tewari, Peter L. Bartlett