Sciweavers

910 search results - page 46 / 182
» Testbed for information extraction from deep web
Sort
View
CCIA
2005
Springer
14 years 1 months ago
Automatic discovery of synonyms and lexicalizations from the Web
The search of Web resources is a very important topic due to the huge amount of valuable information available in the WWW. Standard search engines can be a great help but they are ...
David Sánchez, Antonio Moreno
ADC
2006
Springer
130views Database» more  ADC 2006»
14 years 2 months ago
A two-phase rule generation and optimization approach for wrapper generation
Web information extraction is a fundamental issue for web information management and integrations. A common approach is to use wrappers to extract data from web pages or documents...
Yanan Hao, Yanchun Zhang
AAAI
2008
13 years 10 months ago
Automatic Extraction of Data Points and Text Blocks from 2-Dimensional Plots in Digital Documents
Two dimensional plots (2-D) in digital documents on the web are an important source of information that is largely under-utilized. In this paper, we outline how data and text can ...
Saurabh Kataria, William Browuer, Prasenjit Mitra,...
WWW
2004
ACM
14 years 8 months ago
Automatic extraction of web search interfaces for interface schema integration
This paper provides an overview of a technique for extracting information from the Web search interfaces of e-commerce search engines that is useful for supporting automatic searc...
Hai He, Weiyi Meng, Clement T. Yu, Zonghuan Wu
ICDM
2006
IEEE
164views Data Mining» more  ICDM 2006»
14 years 2 months ago
Unsupervised Learning of Tree Alignment Models for Information Extraction
We propose an algorithm for extracting fields from HTML search results. The output of the algorithm is a database table– a data structure that better lends itself to high-level...
Philip Zigoris, Damian Eads, Yi Zhang