Sciweavers

31 search results - page 3 / 7
» A Large-Scale Web Data Collection as a Natural Language Proc...
Sort
View
ICDE
2008
IEEE
218views Database» more  ICDE 2008»
14 years 8 months ago
AxPRE Summaries: Exploring the (Semi-)Structure of XML Web Collections
The nature of semistructured data in web collections is evolving. Increasingly, XML web documents (or documents exchanged via web services) are valid with regard to a schema, yet ...
Mariano P. Consens, Flavio Rizzolo, Alejandro A. V...
IJCNLP
2005
Springer
14 years 14 days ago
Aligning Needles in a Haystack: Paraphrase Acquisition Across the Web
This paper presents a lightweight method for unsupervised extraction of paraphrases from arbitrary textual Web documents. The method differs from previous approaches to paraphrase...
Marius Pasca, Péter Dienes
WWW
2011
ACM
13 years 1 months ago
Web scale NLP: a case study on url word breaking
This paper uses the URL word breaking task as an example to elaborate what we identify as crucialin designingstatistical natural language processing (NLP) algorithmsfor Web scale ...
Kuansan Wang, Christopher Thrasher, Bo-June Paul H...
NLDB
2005
Springer
14 years 14 days ago
Web Directory Construction Using Lexical Chains
Web Directories provide a way of locating relevant information on the Web. Typically, Web Directories rely on humans putting in significant time and effort into finding important p...
Sofia Stamou, Vlassis Krikos, Pavlos Kokosis, Alex...
LREC
2010
237views Education» more  LREC 2010»
13 years 8 months ago
Entity Mention Detection using a Combination of Redundancy-Driven Classifiers
We present an experimental framework for Entity Mention Detection in which two different classifiers are combined to exploit Data Redundancy attained through the annotation of a l...
Silvana Marianela Bernaola Biggio, Manuela Speranz...