Sciweavers

CEAS
2006
Springer

Annotating Subsets of the Enron Email Corpus

14 years 4 months ago
Annotating Subsets of the Enron Email Corpus
We present an annotation project for two subsets of the Enron email corpus. The first is a subset of the UC Berkeley Enron Email Analysis Project and the second consists of a portion of emails from the Voice Transcripts Email Correlated Corpora. Parts of the automatic content extraction (ACE) annotation guidelines, extended for the email domain are used for annotation. We also categorize the emails with email speech acts, mark whether the text contains discussions of meetings/conversations, and determine the degree of correlation of the subject line with the text body.
Jade Goldstein, Andres Kwasinksi, Paul Kingsbury,
Added 20 Aug 2010
Updated 20 Aug 2010
Type Conference
Year 2006
Where CEAS
Authors Jade Goldstein, Andres Kwasinksi, Paul Kingsbury, Roberta Evans Sabin, Albert McDowell
Comments (0)