Search Sciweavers | Sciweavers

139 search results - page 6 / 28

» Semi-Automatic Wrapper Generation for Internet Information S...

154

click to vote

WEBI
2005
Springer

94views Internet Technology» more WEBI 2005»

ITPilot: A Toolkit for Industrial-Strength Web Data Extraction

16 years 16 days ago

Download www.tic.udc.es

In recent years, many research systems have been proposed to perform data extraction and automation tasks on Web sources. Since most of today’s Web sources are “human-readable...

Alberto Pan, Juan Raposo, Manuel Álvarez, P...

claim paper

Read More »

184

click to vote

ER
1999
Springer

155views Database» more ER 1999»

XML-based Components for Federating Multiple Heterogeneous Data Sources

15 years 11 months ago

Download dntt.free.fr

Several federated database systems have been built in the past using the relational or the object model as federating model. This paper gives an overview of the XMLMedia system, a ...

Georges Gardarin, Fei Sha, Tuyet-Tram Dang-Ngoc

claim paper

Read More »

205

click to vote

AAAI
2007

135views Intelligent Agents» more AAAI 2007»

Template-Independent News Extraction Based on Visual Consistency

15 years 9 months ago

Download www.cse.psu.edu

Wrapper is a traditional method to extract useful information from Web pages. Most previous works rely on the similarity between HTML tag trees and induced template-dependent wrap...

Shuyi Zheng, Ruihua Song, Ji-Rong Wen

claim paper

Read More »

225

click to vote

SIGMOD
1997
ACM

127views Database» more SIGMOD 1997»

Infomaster: An Information Integration System

15 years 11 months ago

Download infolab.stanford.edu

Infomaster is an information integration system that provides integrated access tomultiple distributed heterogeneous information sources on the Internet, thus giving the illusion ...

Michael R. Genesereth, Arthur M. Keller, Oliver M....

claim paper

Read More »

170

click to vote

ICWE
2009
Springer

151views Internet Technology» more ICWE 2009»

A Layout-Independent Web News Article Contents Extraction Method Based on Relevance Analysis

16 years 1 months ago

Download tokuda-www.cs.titech.ac.jp

Abstract. The traditional Web news article contents extraction methods are time-costly and need much maintenance because they analyze the layout of news pages to generate the wrapp...

Hao Han, Takehiro Tokuda

claim paper

Read More »

« Prev « First page 6 / 28 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers