Search Sciweavers | Sciweavers

708 search results - page 17 / 142

» Identifying Content Blocks from Web Documents

125

click to vote

SIGDOC
2004
ACM

133views Document Analysis» more SIGDOC 2004»

Semantic thumbnails: a novel method for summarizing document collections

15 years 8 months ago

Download www.wright.edu

The concept of thumbnails is common in image representation. A thumbnail is a highly compressed version of an image that provides a small, yet complete visual representation to th...

Arijit Sengupta, Mehmet M. Dalkilic, James C. Cost...

claim paper

Read More »

134

click to vote

CDVE
2006
Springer

130views Visualization» more CDVE 2006»

Flexible Collaboration over XML Documents

15 years 6 months ago

Download www.loria.fr

Abstract. XML documents are increasingly being used to mark up various kinds of data from web content to scientific data. Often these documents need to be collaboratively created a...

Claudia-Lavinia Ignat, Moira C. Norrie

claim paper

Read More »

107

click to vote

CIKM
2003
Springer

129views Information Technology» more CIKM 2003»

Extracting unstructured data from template generated web documents

15 years 8 months ago

Download www.ir.iit.edu

We propose a novel approach that identifies web page templates and extracts the unstructured data. Extracting only the body of the page and eliminating the template increases the ...

Ling Ma, Nazli Goharian, Abdur Chowdhury, Misun Ch...

claim paper

Read More »

135

click to vote

AUSAI
2003
Springer

153views Artificial Intelligence» more AUSAI 2003»

Semi-Automatic Construction of Metadata from a Series of Web Documents

15 years 8 months ago

Download qir.kyushu-u.ac.jp

Metadata plays an important role in discovering, collecting, extracting and aggregating Web data. This paper proposes a method of constructing metadata for a speciﬁc topic. The m...

Sachio Hirokawa, Eisuke Itoh, Tetsuhiro Miyahara

claim paper

Read More »

134

click to vote

AUSDM
2006
Springer

160views Data Mining» more AUSDM 2006»

Extraction of Flat and Nested Data Records from Web Pages

15 years 6 months ago

Download crpit.com

This paper deals with studies the problem of identification and extraction of flat and nested data records from a given web page. With the explosive growth of information sources ...

Siddu P. Algur, P. S. Hiremath

claim paper

Read More »

« Prev « First page 17 / 142 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers