Sciweavers

MSR
2015
ACM

Summarizing Complex Development Artifacts by Mining Heterogeneous Data

8 years 7 months ago
Summarizing Complex Development Artifacts by Mining Heterogeneous Data
—Summarization is hailed as a promising approach to reduce the amount of information that must be taken in by the person who wants to understand development artifacts, such as pieces of code, bug reports, emails, etc. However, existing approaches treat artifacts as pure textual entities, disregarding the heterogeneous and partially structured nature of most artifacts, which contain intertwined pieces of distinct type, such as source code, diffs, stack traces, human language, etc. We present a novel approach to augment existing summarization techniques (such as LexRank) to deal with the heterogeneous and multidimensional nature of complex artifacts. Our preliminary results on heterogeneous artifacts suggest our approach outperforms the current text-based approaches.
Added 15 Apr 2016
Updated 15 Apr 2016
Type Journal
Year 2015
Where MSR
Comments (0)