Sciweavers

EMNLP
2004

Automatic Paragraph Identification: A Study across Languages and Domains

14 years 2 months ago
Automatic Paragraph Identification: A Study across Languages and Domains
In this paper we investigate whether paragraphs can be identified automatically in different languages and domains. We propose a machine learning approach which exploits textual and discourse cues and we assess how well humans perform on this task. Our best models achieve an accuracy that is significantly higher than the best baseline and, for most data sets, comes to within 6% of human performance.
Caroline Sporleder, Mirella Lapata
Added 30 Oct 2010
Updated 30 Oct 2010
Type Conference
Year 2004
Where EMNLP
Authors Caroline Sporleder, Mirella Lapata
Comments (0)