Sciweavers

189 search results - page 14 / 38
» Proper Name Extraction from Non-Journalistic Texts
Sort
View
LREC
2010
175views Education» more  LREC 2010»
13 years 8 months ago
Adapting a resource-light highly multilingual Named Entity Recognition system to Arabic
We present a working Arabic information extraction (IE) system that is used to analyze large volumes of news texts every day to extract the named entity (NE) types person, organiz...
Wajdi Zaghouani, Bruno Pouliquen, Mohamed Ebrahim,...
EMNLP
2007
13 years 8 months ago
Learning to Find English to Chinese Transliterations on the Web
We present a method for learning to find English to Chinese transliterations on the Web. In our approach, proper nouns are expanded into new queries aimed at maximizing the probab...
Jian-Cheng Wu, Jason S. Chang
SEMWEB
2009
Springer
14 years 1 months ago
Populating the Semantic Web by Macro-reading Internet Text
A key question regarding the future of the semantic web is “how will we acquire structured information to populate the semantic web on a vast scale?” One approach is to enter t...
Tom M. Mitchell, Justin Betteridge, Andrew Carlson...
WWW
2007
ACM
14 years 8 months ago
Organizing and searching the world wide web of facts -- step two: harnessing the wisdom of the crowds
As part of a large effort to acquire large repositories of facts from unstructured text on the Web, a seed-based framework for textual information extraction allows for weakly sup...
Marius Pasca
WEBI
2005
Springer
14 years 25 days ago
A Semi-Supervised Document Clustering Algorithm Based on EM
Document clustering is a very hard task in Automatic Text Processing since it requires to extract regular patterns from a document collection without a priori knowledge on the cat...
Leonardo Rigutini, Marco Maggini