Sciweavers

1082 search results - page 14 / 217
» Applying Pattern Mining to Web Information Extraction
Sort
View
EACL
2003
ACL Anthology
13 years 9 months ago
Mining Web Sites Using Unsupervised Adaptive Information Extraction
Alexiei Dingli, Fabio Ciravegna, David Guthrie, Yo...
SIGMOD
2010
ACM
201views Database» more  SIGMOD 2010»
13 years 7 months ago
I4E: interactive investigation of iterative information extraction
Information extraction systems are increasingly being used to mine structured information from unstructured text documents. A commonly used unsupervised technique is to build iter...
Anish Das Sarma, Alpa Jain, Divesh Srivastava
IICAI
2003
13 years 9 months ago
Web Usage Mining: Extraction, Maintenance and Behaviour Trends
With the growing popularity of the web, large volumes of data are gathered automatically by Web Servers and collected into access log files. Analysis of such files is generally cal...
Pierre-Alain Laur, Maguelonne Teisseire, Pascal Po...
KDD
2007
ACM
189views Data Mining» more  KDD 2007»
14 years 8 months ago
Corroborate and learn facts from the web
The web contains lots of interesting factual information about entities, such as celebrities, movies or products. This paper describes a robust bootstrapping approach to corrobora...
Shubin Zhao, Jonathan Betz
WSDM
2010
ACM
204views Data Mining» more  WSDM 2010»
14 years 2 months ago
Learning URL patterns for webpage de-duplication
Presence of duplicate documents in the World Wide Web adversely affects crawling, indexing and relevance, which are the core building blocks of web search. In this paper, we pres...
Hema Swetha Koppula, Krishna P. Leela, Amit Agarwa...