Web-scale named entity recognition

15 years 8 months ago

Download www.cse.iitb.ac.in

Automatic recognition of named entities such as people, places, organizations, books, and movies across the entire web presents a number of challenges, both of scale and scope. Data for training general named entity recognizers is difficult to come by, and efficient machine learning methods are required once we have found hundreds of millions of labeled observations. We present an implemented system that addresses these issues, including a method for automatically generating training data, and a multi-class online classification training method that learns to recognize not only high level categories such as place and person, but also more finegrained categories such as soccer players, birds, and universities. The resulting system gives precision and recall performance comparable to that obtained for more limited entity types in much more structured domains such as company recognition in newswire, even though web documents often lack consistent capitalization and grammatical sentence c...

Casey Whitelaw, Alexander Kehlenbeck, Nemanja Petr

Real-time Traffic

CIKM 2008 | General Named Entity | Information Management | Limited Entity Types | Named Entity Recognition |

claim paper

» An Approach to WebScale NamedEntity Disambiguation

» TwiNER named entity recognition in targeted twitter stream

» One Class per Named Entity Exploiting Unlabeled Text for Named Entity Recognition

» Named Entity Chunking Techniques in Supervised Learning for Japanese Named Entity Recognit...

» Entity Name System The BackBone of an Open and Scalable Web of Data

» Using Corpusderived Name Lists for Named Entity Recognition

» What makes a gene name Named entity recognition in the biomedical literature

» Bootstrapping Named Entity Recognition with Automatically Generated Gazetteer Lists

» Efficient combined approach for named entity recognition in spoken language

Post Info
More Details (n/a)

Added	12 Oct 2010
Updated	12 Oct 2010
Type	Conference
Year	2008
Where	CIKM
Authors	Casey Whitelaw, Alexander Kehlenbeck, Nemanja Petrovic, Lyle H. Ungar

Comments (0)

Sciweavers

Web-scale named entity recognition

CIKM 2008 | General Named Entity | Information Management | Limited Entity Types | Named Entity Recognition |

Explore & Download

Productivity Tools

Sciweavers