Sciweavers

358 search results - page 30 / 72
» Online Testing with Reinforcement Learning
Sort
View
ICML
2001
IEEE
14 years 10 months ago
Expectation Maximization for Weakly Labeled Data
We call data weakly labeled if it has no exact label but rather a numerical indication of correctness of the label "guessed" by the learning algorithm - a situation comm...
Yuri A. Ivanov, Bruce Blumberg, Alex Pentland
ALENEX
2008
133views Algorithms» more  ALENEX 2008»
13 years 11 months ago
Comparing Online Learning Algorithms to Stochastic Approaches for the Multi-Period Newsvendor Problem
The multi-period newsvendor problem describes the dilemma of a newspaper salesman--how many paper should he purchase each day to resell, when he doesn't know the demand? We d...
Shawn O'Neil, Amitabh Chaudhary
HICSS
2006
IEEE
160views Biometrics» more  HICSS 2006»
14 years 4 months ago
A Case Study of a Longstanding Online Community of Practice Involving Critical Care and Advanced Practice Nurses
The aims of this study are: (1) to examine to what extent critical care and advanced practice nurses’ participation in an online listserv constituted a community of practice, an...
Noriko Hara, Khe Foon Hew
ROMAN
2007
IEEE
179views Robotics» more  ROMAN 2007»
14 years 4 months ago
Online Affect Detection and Adaptation in Robot Assisted Rehabilitation for Children with Autism
–This paper presents a novel affect-sensitive human-robot interaction framework for rehabilitation of children with autism spectrum disorder (ASD) where the robot can detect the ...
Changchun Liu, Karla Conn, Nilanjan Sarkar, Wendy ...
GECCO
2011
Springer
236views Optimization» more  GECCO 2011»
13 years 1 months ago
Online, GA based mixture of experts: a probabilistic model of ucs
In recent years there have been efforts to develop a probabilistic framework to explain the workings of a Learning Classifier System. This direction of research has met with lim...
Narayanan Unny Edakunni, Gavin Brown, Tim Kovacs