The paper presents a new approach to the problem of paraphrase identification. The new approach extends a previously proposed method for the task of textual entailment. The relationship between paraphrases and entailment is discussed to theoretically justify the new approach. The proposed approach is useful because it uses relatively few resources compared to similar systems yet it produces results similar or better than other approaches to paraphrase identification. The approach also offers significantly better results than two baselines. We report results on a standard data set as well as on a new, balanced data set.
Vasile Rus, Philip M. McCarthy, Mihai C. Lintean,