Sciweavers

IRCDL
2010

A New Domain Independent Keyphrase Extraction System

13 years 11 months ago
A New Domain Independent Keyphrase Extraction System
In this paper we present a keyphrase extraction system that can extract potential phrases from a single document in an unsupervised, domain-independent way. We extract word n-grams from input document. We incorporate linguistic knowledge (i.e., part-of-speech tags), and statistical information (i.e., frequency, position, lifespan) of each n-gram in defining candidate phrases and their respective feature sets. The proposed approach can be applied to any document, however, in order to know the effectiveness of the system for digital libraries, we have carried out the evaluation on a set of scientific documents, and compared our results with current keyphrase extraction systems.
Nirmala Pudota, Antonina Dattolo, Andrea Baruzzo,
Added 28 Jan 2011
Updated 28 Jan 2011
Type Journal
Year 2010
Where IRCDL
Authors Nirmala Pudota, Antonina Dattolo, Andrea Baruzzo, Carlo Tasso
Comments (0)