Sciweavers

MSR
2009
ACM

Automatic labeling of software components and their evolution using log-likelihood ratio of word frequencies in source code

14 years 5 months ago
Automatic labeling of software components and their evolution using log-likelihood ratio of word frequencies in source code
As more and more open-source software components become available on the internet we need automatic ways to label and compare them. For example, a developer who searches for reusable software must be able to quickly gain an understanding of retrieved components. This understanding cannot be gained at the level of source code due to the semantic gap between source code and the domain model. In this paper we present a lexical approach that uses the log-likelihood ratios of word frequencies to automatically provide labels for software components. We present a prototype implementation of our labeling/comparison algorithm and provide examples of its application. In particular, we apply the approach to detect trends in the evolution of a software system.
Adrian Kuhn
Added 23 Jul 2010
Updated 23 Jul 2010
Type Conference
Year 2009
Where MSR
Authors Adrian Kuhn
Comments (0)