Unsupervised Phrasal Near-Synonym Generation from Text Corpora

10 years 4 months ago

Download www.cs.cmu.edu

Unsupervised discovery of synonymous phrases is useful in a variety of tasks ranging from text mining and search engines to semantic analysis and machine translation. This paper presents an unsupervised corpus-based conditional model: Near-Synonym System (NeSS) for finding phrasal synonyms and near synonyms that requires only a large monolingual corpus. The method is based on maximizing information-theoretic combinations of shared contexts and is parallelizable for large-scale processing. An evaluation framework with crowd-sourced judgments is proposed and results are compared with alternate methods, demonstrating considerably superior results to the literature and to thesaurus look up for multi-word phrases. Moreover, the results show that the statistical scoring functions and overall scalability of the system are more important than language specific NLP tools. The method is language-independent and practically useable due to accuracy and real-time performance via parallel decomposi...

Dishan Gupta, Jaime G. Carbonell, Anatole Gershman

Real-time Traffic

AAAI 2015 | Intelligent Agents |

claim paper

Post Info
More Details (n/a)

Added	27 Mar 2016
Updated	27 Mar 2016
Type	Journal
Year	2015
Where	AAAI
Authors	Dishan Gupta, Jaime G. Carbonell, Anatole Gershman, Steve Klein, David Miller

Comments (0)

Sciweavers

Unsupervised Phrasal Near-Synonym Generation from Text Corpora

AAAI 2015 | Intelligent Agents |

Explore & Download

Productivity Tools

Sciweavers