The Manually Annotated Sub-Corpus: A Community Resource for and by the People

15 years 4 months ago

Download www.cs.vassar.edu

The Manually Annotated Sub-Corpus (MASC) project provides data and annotations to serve as the base for a communitywide annotation effort of a subset of the American National Corpus. The MASC infrastructure enables the incorporation of contributed annotations into a single, usable format that can then be analyzed as it is or ported to any of a variety of other formats. MASC includes data from a much wider variety of genres than existing multiply-annotated corpora of English, and the project is committed to a fully open model of distribution, without restriction, for all data and annotations produced or contributed. As such, MASC is the first large-scale, open, communitybased effort to create much needed language resources for NLP. This paper describes the MASC project, its corpus and annotations, and serves as a call for contributions of data and annotations from the language processing community.

Nancy Ide, Collin F. Baker, Christiane Fellbaum, R

Real-time Traffic

ACL 2010 | Annotations | Computational Linguistics | MASC | MASC Infrastructure |

claim paper

» New challenges for text mining mapping between text and manually curated pathways

» Uncertainty Corpus Resource to Study User Affect in Complex Spoken Dialogue Systems

» Automatic Adaptation of Annotation Standards Chinese Word Segmentation and POS Tagging A ...

» Exploring social annotations for the semantic web

» A humanmachine collaborative approach to tracking human movement in multicamera video

» The Universal Protein Resource UniProt

» Provenance and evidence in UniProtKB

» Syntactic Annotation Guidelines for the Quranic Arabic Dependency Treebank

Post Info
More Details (n/a)

Added	10 Feb 2011
Updated	10 Feb 2011
Type	Journal
Year	2010
Where	ACL
Authors	Nancy Ide, Collin F. Baker, Christiane Fellbaum, Rebecca J. Passonneau

Comments (0)

Sciweavers

The Manually Annotated Sub-Corpus: A Community Resource for and by the People

ACL 2010 | Annotations | Computational Linguistics | MASC | MASC Infrastructure |

Explore & Download

Productivity Tools

Sciweavers