Sciweavers

ISMDA
2003
Springer

Data Imbalance in Surveillance of Nosocomial Infections

14 years 4 months ago
Data Imbalance in Surveillance of Nosocomial Infections
Abstract. An important problem that arises in hospitals is the monitoring and detection of nosocomial or hospital acquired infections (NIs). This paper describes a retrospective analysis of a prevalence survey of NIs done in the Geneva University Hospital. Our goal is to identify patients with one or more NIs on the basis of clinical and other data collected during the survey. In this classification task, the main difficulty resides in the significant imbalance between positive or infected (11%) and negative (89%) cases. To remedy class imbalance, we propose a novel approach in which both oversampling of rare positives and undersampling of the non infected majority rely on synthetic cases generated via class-specific subclustering. Experiments have shown this approach to be remarkably more effective than classical random resampling methods.
Gilles Cohen, Melanie Hilario, Hugo Sax, Sté
Added 07 Jul 2010
Updated 07 Jul 2010
Type Conference
Year 2003
Where ISMDA
Authors Gilles Cohen, Melanie Hilario, Hugo Sax, Stéphane Hugonnet
Comments (0)