Sciweavers

2677 search results - page 47 / 536
» Extracting Structured Data from Web Pages
Sort
View
SIGMOD
2009
ACM
140views Database» more  SIGMOD 2009»
14 years 3 months ago
Robust web extraction: an approach based on a probabilistic tree-edit model
On script-generated web sites, many documents share common HTML tree structure, allowing wrappers to effectively extract information of interest. Of course, the scripts and thus ...
Nilesh N. Dalvi, Philip Bohannon, Fei Sha
IUI
2006
ACM
14 years 2 months ago
Recovering semantic relations from web pages based on visual cues
Recovering semantic relations between different parts of web pages are of great importance for multi-platform web interface development, as they make it possible to re-distribute ...
Peifeng Xiang, Yuanchun Shi
WSDM
2012
ACM
252views Data Mining» more  WSDM 2012»
12 years 4 months ago
WebSets: extracting sets of entities from the web using unsupervised information extraction
We describe a open-domain information extraction method for extracting concept-instance pairs from an HTML corpus. Most earlier approaches to this problem rely on combining cluste...
Bhavana Bharat Dalvi, William W. Cohen, Jamie Call...
COMAD
2008
13 years 10 months ago
Kshitij: A Search and Page Recommendation System for Wikipedia
Semantic information helps in identifying the context of a document. It will be interesting to find out how effectively this information can be used in recommending related docume...
Phanikumar Bhamidipati, Kamalakar Karlapalem
WWW
2007
ACM
14 years 9 months ago
Extraction and classification of dense communities in the web
The World Wide Web (WWW) is rapidly becoming important for society as a medium for sharing data, information and services, and there is a growing interest in tools for understandi...
Yon Dourisboure, Filippo Geraci, Marco Pellegrini