Nested Named Entity Recognition in Historical Archive Text

16 years 1 months ago

Download homepages.inf.ed.ac.uk

This paper describes work on Named Entity Recognition (NER), in preparation for Relation Extraction (RE), on data from a historical archive organisation. As is often the case in the cultural heritage domain, the source text includes a high percentage of specialist terminology, and is of very variable quality in terms of grammaticality and completeness. The NER and RE tasks were carried out using a specially annotated corpus, and are themselves preliminary steps in a larger project whose aim is to transform discovered relations into a graph structure that can be queried using standard tools. Experimental results from the NER task are described, with emphasis on dealing with nested entities using a multi-word token method. The overall objective is to improve access by non-specialist users to a valuable cultural resource.

Kate Byrne

Real-time Traffic

Historical Archive Organisation | Named Entity Recognition | Paper Describes Work | Semantic Computing | SEMCO 2007 |

claim paper

Added	04 Jun 2010
Updated	04 Jun 2010
Type	Conference
Year	2007
Where	SEMCO
Authors	Kate Byrne

Sciweavers

Nested Named Entity Recognition in Historical Archive Text

Historical Archive Organisation | Named Entity Recognition | Paper Describes Work | Semantic Computing | SEMCO 2007 |

Explore & Download

Productivity Tools

Sciweavers