Sciweavers

288 search results - page 24 / 58
» Extracting compound terms from domain corpora
Sort
View
WWW
2009
ACM
14 years 8 months ago
Extracting article text from the web with maximum subsequence segmentation
Much of the information on the Web is found in articles from online news outlets, magazines, encyclopedias, review collections, and other sources. However, extracting this content...
Jeff Pasternack, Dan Roth
LREC
2010
140views Education» more  LREC 2010»
13 years 9 months ago
mwetoolkit: a Framework for Multiword Expression Identification
This paper presents the Multiword Expression Toolkit (mwetoolkit), an environment for type and language-independent MWE identification from corpora. The mwetoolkit provides a targ...
Carlos Ramisch, Aline Villavicencio, Christian Boi...
LREC
2008
153views Education» more  LREC 2008»
13 years 9 months ago
Linguistically Light Lexical Extensions for Ontologies
An increasing number of enterprises are beginning to include semantic web ontologies into their Information Extraction (IE) and Text Analytics (TA) applications. This can be chall...
Brian Davis, Siegfried Handschuh, Alexander Trouss...
PAKM
2004
13 years 9 months ago
Automatic Generation of Taxonomies from the WWW
In this paper we present a methodology to extract information from the Web to build a taxonomy of terms and Web resources for a given domain. This taxonomy represents a hierarchy o...
David Sánchez, Antonio Moreno
DSS
2007
113views more  DSS 2007»
13 years 7 months ago
An associate constraint network approach to extract multi-lingual information for crime analysis
International crime and terrorism have drawn increasing attention in recent years. Retrieving relevant information from criminal records and suspect communications is important in...
Christopher C. Yang, Kar Wing Li