Automatic labeling of software components and their evolution using log-likelihood ratio of word frequencies in source code