Sciweavers

CIDU
2010

Analyzing Aviation Safety Reports: From Topic Modeling to Scalable Multi-Label Classification

13 years 9 months ago
Analyzing Aviation Safety Reports: From Topic Modeling to Scalable Multi-Label Classification
Abstract. The Aviation Safety Reporting System (ASRS) is used to collect voluntarily submitted aviation safety reports from pilots, controllers and others. As such it is particularly useful in researching aviation safety deficiencies. In this paper we address two challenges related to the analysis of ASRS data: (1) the unsupervised extraction of meaningful and interpretable topics from ASRS reports and (2) multi-label classification of ASRS data based on a set of predefined categories. For topic modeling we investigate the practical usefulness of Latent Dirichlet Allocation (LDA) when it comes to modeling ASRS reports in terms of interpretable topics. We also utilize LDA to generate a more compact representation of ASRS reports to be used in multi-label classification. For multi-label classification we propose a novel and highly scalable multi-label classification algorithm based on multi-variate regression. Empirical results indicate that our approach is superior to several baseline a...
Amrudin Agovic, Hanhuai Shan, Arindam Banerjee
Added 10 Feb 2011
Updated 10 Feb 2011
Type Journal
Year 2010
Where CIDU
Authors Amrudin Agovic, Hanhuai Shan, Arindam Banerjee
Comments (0)