

Seeded Discovery of Base Relations in Large Corpora

14 years 3 months ago
Seeded Discovery of Base Relations in Large Corpora
Relationship discovery is the task of identifying salient relationships between named entities in text. We propose novel approaches for two sub-tasks of the problem: identifying the entities of interest, and partitioning and describing the relations based on their semantics. In particular, we show that term frequency patterns can be used effectively instead of supervised NER, and that the pmedian clustering objective function naturally uncovers relation exemplars appropriate for describing the partitioning. Furthermore, we introduce a novel application of relationship discovery: the unsupervised identification of protein-protein interaction phrases.
Nicholas Andrews, Naren Ramakrishnan
Added 29 Oct 2010
Updated 29 Oct 2010
Type Conference
Year 2008
Authors Nicholas Andrews, Naren Ramakrishnan
Comments (0)