Sciweavers

SIGSOFT
2010
ACM

Software is data too

13 years 9 months ago
Software is data too
Software systems are designed and engineered to process data. However, software is data too. The size and variety of today's software artifacts and the multitude of stakeholder activities result in so much data that individuals can no longer reason about all of it. We argue in this position paper that data mining, statistical analysis, machine learning, information retrieval, data integration, etc., are necessary solutions to deal with software data. New research is needed to adapt existing algorithms and tools for software engineering data and processes, and new ones will have to be created. In order for this type of research to succeed, it should be supported with new approaches to empirical work, where data and results are shared globally among researchers and practitioners. Software engineering researchers can get inspired by other fields, such as, bioinformatics, where results of mining and analyzing biological data are often stored in databases shared across the world. Cate...
Andrian Marcus, Tim Menzies
Added 15 Feb 2011
Updated 15 Feb 2011
Type Journal
Year 2010
Where SIGSOFT
Authors Andrian Marcus, Tim Menzies
Comments (0)