In a 12-month project we have developed a new, register-diverse, 55-million-word bilingual corpus--the New Corpus for Ireland (NCI)--to support the creation of a new English-to-Irish dictionary. The paper describes the strategies we employed, and the solutions to problems encountered. We believe we have a good model for corpus creation for lexicography, and others may find it useful as a blueprint. The corpus has two parts, one Irish, the other Hiberno-English (English as spoken in Ireland). We describe its design, collection and encoding. Keywords Corpus linguistics