We propose a new method that enhances automatic keyphrase extraction by using semantic information on terms and phrases gleaned from a domain-specific thesaurus. We evaluate the results against keyphrase sets assigned by a state-of-the-art keyphrase extraction system and those assigned by six professional indexers. Categories and Subject Descriptors H.3.1 [Content Analysis and Indexing]: Indexing methods, linguistic processing, thesauruses. General Terms Algorithms, Performance, Reliability, Experimentation. Keywords Automatic indexing, machine aided indexing, keyphrase extraction, keyphrase assignment.
Olena Medelyan, Ian H. Witten