Annotating Subsets of the Enron Email Corpus

15 years 10 months ago

Download www.ceas.cc

We present an annotation project for two subsets of the Enron email corpus. The first is a subset of the UC Berkeley Enron Email Analysis Project and the second consists of a portion of emails from the Voice Transcripts Email Correlated Corpora. Parts of the automatic content extraction (ACE) annotation guidelines, extended for the email domain are used for annotation. We also categorize the emails with email speech acts, mark whether the text contains discussions of meetings/conversations, and determine the degree of correlation of the subject line with the text body.

Jade Goldstein, Andres Kwasinksi, Paul Kingsbury,

Real-time Traffic

CEAS 2006 | Email | Enron Email | Enron Email Corpus | Internet Technology |

claim paper

» Avatar semantic search a database approach to information retrieval

Post Info
More Details (n/a)

Added	20 Aug 2010
Updated	20 Aug 2010
Type	Conference
Year	2006
Where	CEAS
Authors	Jade Goldstein, Andres Kwasinksi, Paul Kingsbury, Roberta Evans Sabin, Albert McDowell

Comments (0)

Sciweavers

Annotating Subsets of the Enron Email Corpus

CEAS 2006 | Email | Enron Email | Enron Email Corpus | Internet Technology |

Explore & Download

Productivity Tools

Sciweavers