Search Sciweavers | Sciweavers

103 search results - page 4 / 21

» Visual Web Information Extraction with Lixto

222

click to vote

WWW
2007
ACM

144views Internet Technology» more WWW 2007»

Towards domain-independent information extraction from web tables

16 years 7 months ago

Download www2007.org

Traditionally, information extraction from web tables has focused on small, more or less homogeneous corpora, often based on assumptions about the use of <table> tags. A mul...

Bernhard Krüpl, Bernhard Pollak, Marcus Herzo...

claim paper

Read More »

195

click to vote

AAAI
2006

123views Intelligent Agents» more AAAI 2006»

Table Extraction Using Spatial Reasoning on the CSS2 Visual Box Model

15 years 8 months ago

Download www.aaai.org

Tables on web pages contain a huge amount of semantically explicit information, which makes them a worthwhile target for automatic information extraction and knowledge acquisition...

Wolfgang Gatterbauer, Paul Bohunsky

claim paper

Read More »

197

click to vote

APWEB
2003
Springer

148views Internet Technology» more APWEB 2003»

Extracting Content Structure for Web Pages Based on Visual Representation

16 years 7 days ago

Download www.dbs.ifi.lmu.de

Abstract. A new web content structure based on visual representation is proposed in this paper. Many web applications such as information retrieval, information extraction and auto...

Deng Cai, Shipeng Yu, Ji-Rong Wen, Wei-Ying Ma

claim paper

Read More »

201

click to vote

ICDM
2007
IEEE

149views Data Mining» more ICDM 2007»

Extracting Author Meta-Data from Web Using Visual Features

16 years 1 months ago

Download www.cse.psu.edu

Enriching digital library’s author meta-data can lead to valuable services and applications. This paper addresses the problem of extracting authors’ information from their hom...

Shuyi Zheng, Ding Zhou, Jia Li, C. Lee Giles

claim paper

Read More »

189

click to vote

AUSDM
2006
Springer

160views Data Mining» more AUSDM 2006»

Extraction of Flat and Nested Data Records from Web Pages

15 years 10 months ago

Download crpit.com

This paper deals with studies the problem of identification and extraction of flat and nested data records from a given web page. With the explosive growth of information sources ...

Siddu P. Algur, P. S. Hiremath

claim paper

Read More »

« Prev « First page 4 / 21 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers