We implemented a web server for acronym and abbreviation lookup, containing a collection of acronyms and their expansions gathered from a large number of web pages by a heuristic extraction process. Several different extraction algorithms were evaluated and compared. The corpus resulting from the best algorithm is comparable to a highquality hand-crafted site, but has the potential to be much more inclusive as data from more web pages are processed.
Leah S. Larkey, Paul Ogilvie, M. Andrew Price, Bre