Search Sciweavers | Sciweavers

684 search results - page 37 / 137

» Extracting semantic structure of web documents using content...

124

click to vote

DEXA
2009
Springer

173views Database» more DEXA 2009»

Incremental Ontology-Based Extraction and Alignment in Semi-structured Documents

15 years 11 months ago

Download wwwdi.supelec.fr

SHIRI 1 is an ontology-based system for integration of semistructured documents related to a speciﬁc domain. The system’s purpose is to allow users to access to relevant parts ...

Mouhamadou Thiam, Nacéra Bennacer, Nathalie...

claim paper

Read More »

116

click to vote

KES
2006
Springer

137views Information Technology» more KES 2006»

Web Site Off-Line Structure Reconfiguration: A Web User Browsing Analysis

15 years 4 months ago

Download wi.dii.uchile.cl

The correct web site text content must be help to the visitors to find what they are looking for. However, the reality is quite different, many times the web page text content is a...

Sebastián A. Ríos, Juan D. Vel&aacut...

claim paper

Read More »

151

click to vote

PODS
2002
ACM

117views Database» more PODS 2002»

Monadic Datalog and the Expressive Power of Languages for Web Information Extraction

16 years 4 months ago

Download www.cs.cornell.edu

Research on information extraction from Web pages (wrapping) has seen much activity in recent times (particularly systems implementations), but little work has been done on formal...

Georg Gottlob, Christoph Koch

claim paper

Read More »

241

click to vote

VLDB
2003
ACM

125views Database» more VLDB 2003»

THESUS: Organizing Web document collections based on link semantics

16 years 4 months ago

Download www.db-net.aueb.gr

Abstract. The requirements for effective search and management of the WWW are stronger than ever. Currently Web documents are classified based on their content not taking into acco...

Maria Halkidi, Benjamin Nguyen, Iraklis Varlamis, ...

claim paper

Read More »

136

click to vote

ECIR
2008
Springer

185views Information Technology» more ECIR 2008»

Clustering Template Based Web Documents

15 years 5 months ago

Download www.informatik.uni-mainz.de

More and more documents on the World Wide Web are based on templates. On a technical level this causes those documents to have a quite similar source code and DOM tree structure. G...

Thomas Gottron

claim paper

Read More »

« Prev « First page 37 / 137 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers