Sciweavers

LRE
2006

Efficient corpus development for lexicography: building the New Corpus for Ireland

14 years 12 days ago
Efficient corpus development for lexicography: building the New Corpus for Ireland
In a 12-month project we have developed a new, register-diverse, 55-million-word bilingual corpus--the New Corpus for Ireland (NCI)--to support the creation of a new English-to-Irish dictionary. The paper describes the strategies we employed, and the solutions to problems encountered. We believe we have a good model for corpus creation for lexicography, and others may find it useful as a blueprint. The corpus has two parts, one Irish, the other Hiberno-English (English as spoken in Ireland). We describe its design, collection and encoding. Keywords Corpus linguistics
Adam Kilgarriff, Michael Rundell, Elaine Uí
Added 14 Dec 2010
Updated 14 Dec 2010
Type Journal
Year 2006
Where LRE
Authors Adam Kilgarriff, Michael Rundell, Elaine Uí Dhonnchadha
Comments (0)