Sciweavers

267 search results - page 8 / 54
» Automatic Wrappers for Large Scale Web Extraction
Sort
View
ICDE
2007
IEEE
155views Database» more  ICDE 2007»
14 years 9 months ago
Collaborative Wrapping: A Turbo Framework for Web Data Extraction
To access data sources on the Web, a crucial step is wrapping, which translates query responses, rendered in textual HTML, back into their relational form. Traditionally, this pro...
Shui-Lung Chuang, Kevin Chen-Chuan Chang, ChengXia...
CLOUDCOM
2010
Springer
13 years 5 months ago
Efficient Metadata Generation to Enable Interactive Data Discovery over Large-Scale Scientific Data Collections
Discovering the correct dataset efficiently is critical for computations and effective simulations in scientific experiments. In contrast to searching web documents over the Intern...
Sangmi Lee Pallickara, Shrideep Pallickara, Milija...
KDD
2007
ACM
193views Data Mining» more  KDD 2007»
14 years 8 months ago
Joint optimization of wrapper generation and template detection
Many websites have large collections of pages generated dynamically from an underlying structured source like a database. The data of a category are typically encoded into similar...
Shuyi Zheng, Ruihua Song, Ji-Rong Wen, Di Wu
SIGMOD
2006
ACM
107views Database» more  SIGMOD 2006»
14 years 7 months ago
Documentum ECI self-repairing wrappers: performance analysis
Documentum Enterprise Content Integration (ECI) services is a content integration middleware that provides one-query access to the Intranet and Internet content resources. The ECI...
Boris Chidlovskii, Bruno Roustant, Marc Brette
WEBI
2005
Springer
14 years 1 months ago
ITPilot: A Toolkit for Industrial-Strength Web Data Extraction
In recent years, many research systems have been proposed to perform data extraction and automation tasks on Web sources. Since most of today’s Web sources are “human-readable...
Alberto Pan, Juan Raposo, Manuel Álvarez, P...