Sciweavers

1947 search results - page 17 / 390
» On the Automatic Extraction of Data from the Hidden Web
Sort
View
ICDE
2006
IEEE
156views Database» more  ICDE 2006»
14 years 10 months ago
Extracting Objects from the Web
Extracting and integrating object information from the Web is of great significance for Web data management. The existing Web information extraction techniques cannot provide sati...
Zaiqing Nie, Fei Wu, Ji-Rong Wen, Wei-Ying Ma
NAACL
2003
13 years 10 months ago
A Web-Trained Extraction Summarization System
A serious bottleneck in the development of trainable text summarization systems is the shortage of training data. Constructing such data is a very tedious task, especially because...
Liang Zhou, Eduard H. Hovy
WSDM
2012
ACM
252views Data Mining» more  WSDM 2012»
12 years 4 months ago
WebSets: extracting sets of entities from the web using unsupervised information extraction
We describe a open-domain information extraction method for extracting concept-instance pairs from an HTML corpus. Most earlier approaches to this problem rely on combining cluste...
Bhavana Bharat Dalvi, William W. Cohen, Jamie Call...
EMNLP
2007
13 years 10 months ago
Bootstrapping Information Extraction from Field Books
We present two machine learning approaches to information extraction from semi-structured documents that can be used if no annotated training data are available, but there does ex...
Sander Canisius, Caroline Sporleder
WWW
2010
ACM
14 years 3 months ago
Entity relation discovery from web tables and links
The World-Wide Web consists not only of a huge number of unstructured texts, but also a vast amount of valuable structured data. Web tables [2] are a typical type of structured in...
Cindy Xide Lin, Bo Zhao, Tim Weninger, Jiawei Han,...