Sciweavers

ACL
2012

Named Entity Disambiguation in Streaming Data

12 years 2 months ago
Named Entity Disambiguation in Streaming Data
The named entity disambiguation task is to resolve the many-to-many correspondence between ambiguous names and the unique realworld entity. This task can be modeled as a classification problem, provided that positive and negative examples are available for learning binary classifiers. High-quality senseannotated data, however, are hard to be obtained in streaming environments, since the training corpus would have to be constantly updated in order to accomodate the fresh data coming on the stream. On the other hand, few positive examples plus large amounts of unlabeled data may be easily acquired. Producing binary classifiers directly from this data, however, leads to poor disambiguation performance. Thus, we propose to enhance the quality of the classifiers using finer-grained variations of the well-known ExpectationMaximization (EM) algorithm. We conducted a systematic evaluation using Twitter streaming data and the results show that our classifiers are extremely effective, pro...
Alexandre Davis, Adriano Veloso, Altigran Soares d
Added 29 Sep 2012
Updated 29 Sep 2012
Type Journal
Year 2012
Where ACL
Authors Alexandre Davis, Adriano Veloso, Altigran Soares da Silva, Alberto H. F. Laender, Wagner Meira Jr.
Comments (0)