Search Sciweavers | Sciweavers

232 search results - page 13 / 47

» Query-related data extraction of hidden web documents

120

click to vote

KDD
2002
ACM

148views Data Mining» more KDD 2002»

Discovering informative content blocks from Web documents

16 years 4 months ago

Download www.cs.ualberta.ca

In this paper, we propose a new approach to discover informative contents from a set of tabular documents (or Web pages) of a Web site. Our system, InfoDiscoverer, first partition...

Shian-Hua Lin, Jan-Ming Ho

claim paper

Read More »

127

click to vote

CIKM
2005
Springer

166views Information Technology» more CIKM 2005»

Concept-based interactive query expansion

15 years 9 months ago

Download homepages.dcc.ufmg.br

Despite the recent advances in search quality, the fast increase in the size of the Web collection has introduced new challenges for Web ranking algorithms. In fact, there are sti...

Bruno M. Fonseca, Paulo Braz Golgher, Bruno P&ocir...

claim paper

Read More »

122

Voted

SAINT
2005
IEEE

120views Internet Technology» more SAINT 2005»

Learning Logic Wrappers for Information Extraction from the Web

15 years 9 months ago

Download software.ucv.ro

This paper discusses a methodology for applying general-purpose ﬁrst-order inductive learning to extract information from Web documents structured as unranked ordered trees. The...

Costin Badica, Elvira Popescu, Amelia Badica

claim paper

Read More »

156

Voted

DEXA
2005
Springer

109views Database» more DEXA 2005»

An XML Approach to Semantically Extract Data from HTML Tables

15 years 9 months ago

Download www.cis.unisa.edu.au

Abstract. Data intensive information is often published on the internet in the format of HTML tables. Extracting some of the information that is of users’ interest from the inter...

Jixue Liu, Zhuoyun Ao, Ho-Hyun Park, Yongfeng Chen

claim paper

Read More »

127

Voted

PVLDB
2010

135views more PVLDB 2010»

SXPath - Extending XPath towards Spatial Querying on Web Documents

15 years 2 months ago

Download www.vldb.org

Querying data from presentation formats like HTML, for purposes such as information extraction, requires the consideration of tree structures as well as the consideration of spati...

Ermelinda Oro, Massimo Ruffolo, Steffen Staab

claim paper

Read More »

« Prev « First page 13 / 47 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers