Sciweavers

ACL
2015

Summarization of Multi-Document Topic Hierarchies using Submodular Mixtures

8 years 7 months ago
Summarization of Multi-Document Topic Hierarchies using Submodular Mixtures
We study the problem of summarizing DAG-structured topic hierarchies over a given set of documents. Example applications include automatically generating Wikipedia disambiguation pages for a set of articles, and generating candidate multi-labels for preparing machine learning datasets (e.g., for text classification, functional genomics, and image classification). Unlike previous work, which focuses on clustering the set of documents using the topic hierarchy as features, we directly pose the problem as a submodular optimization problem on a topic hierarchy using the documents as features. Desirable properties of the chosen topics include document coverage, specificity, topic diversity, and topic homogeneity, each of which, we show, is naturally modeled by a submodular function. Other information, provided say by unsupervised approaches such as LDA and its variants, can also be utilized by defining a submodular function that expresses coherence between the chosen topics and this in...
Ramakrishna Bairi, Rishabh K. Iyer, Ganesh Ramakri
Added 13 Apr 2016
Updated 13 Apr 2016
Type Journal
Year 2015
Where ACL
Authors Ramakrishna Bairi, Rishabh K. Iyer, Ganesh Ramakrishnan, Jeff A. Bilmes
Comments (0)