A New Domain Independent Keyphrase Extraction System

15 years 6 months ago

Download users.dimi.uniud.it

In this paper we present a keyphrase extraction system that can extract potential phrases from a single document in an unsupervised, domain-independent way. We extract word n-grams from input document. We incorporate linguistic knowledge (i.e., part-of-speech tags), and statistical information (i.e., frequency, position, lifespan) of each n-gram in deﬁning candidate phrases and their respective feature sets. The proposed approach can be applied to any document, however, in order to know the eﬀectiveness of the system for digital libraries, we have carried out the evaluation on a set of scientiﬁc documents, and compared our results with current keyphrase extraction systems.

Nirmala Pudota, Antonina Dattolo, Andrea Baruzzo,

Real-time Traffic

Current Keyphrase Extraction | Digital Library | Documents | IRCDL 2010 | Keyphrase Extraction |

claim paper

Post Info
More Details (n/a)

Added	28 Jan 2011
Updated	28 Jan 2011
Type	Journal
Year	2010
Where	IRCDL
Authors	Nirmala Pudota, Antonina Dattolo, Andrea Baruzzo, Carlo Tasso

Comments (0)

Sciweavers

A New Domain Independent Keyphrase Extraction System

Current Keyphrase Extraction | Digital Library | Documents | IRCDL 2010 | Keyphrase Extraction |

Explore & Download

Productivity Tools

Sciweavers