Automatic labeling of software components and their evolution using log-likelihood ratio of word frequencies in source code

15 years 11 months ago

Download scg.unibe.ch

As more and more open-source software components become available on the internet we need automatic ways to label and compare them. For example, a developer who searches for reusable software must be able to quickly gain an understanding of retrieved components. This understanding cannot be gained at the level of source code due to the semantic gap between source code and the domain model. In this paper we present a lexical approach that uses the log-likelihood ratios of word frequencies to automatically provide labels for software components. We present a prototype implementation of our labeling/comparison algorithm and provide examples of its application. In particular, we apply the approach to detect trends in the evolution of a software system.

Adrian Kuhn

Real-time Traffic

MSR 2009 | Open-source Software Components | Software Components | Source Code |

claim paper

Post Info
More Details (n/a)

Added	23 Jul 2010
Updated	23 Jul 2010
Type	Conference
Year	2009
Where	MSR
Authors	Adrian Kuhn

Comments (0)

Sciweavers

Automatic labeling of software components and their evolution using log-likelihood ratio of word frequencies in source code

MSR 2009 | Open-source Software Components | Software Components | Source Code |

Explore & Download

Productivity Tools

Sciweavers