Sciweavers

ACL
2003

Automatic Acquisition of Named Entity Tagged Corpus from World Wide Web

14 years 25 days ago
Automatic Acquisition of Named Entity Tagged Corpus from World Wide Web
In this paper, we present a method that automatically constructs a Named Entity (NE) tagged corpus from the web to be used for learning of Named Entity Recognition systems. We use an NE list and an web search engine to collect web documents which contain the NE instances. The documents are refined through sentence separation and text refinement procedures and NE instances are finally tagged with the appropriate NE categories. Our experiments demonstrates that the suggested method can acquire enough NE tagged corpus equally useful to the manually tagged one without any human intervention.
Joohui An, Seungwoo Lee, Gary Geunbae Lee
Added 31 Oct 2010
Updated 31 Oct 2010
Type Conference
Year 2003
Where ACL
Authors Joohui An, Seungwoo Lee, Gary Geunbae Lee
Comments (0)