Sciweavers

95 search results - page 5 / 19
» Classifying Web Pages with Visual Features
Sort
View
LPNMR
2001
Springer
14 years 1 days ago
Declarative Information Extraction, Web Crawling, and Recursive Wrapping with Lixto
Lixto is a system and method for the visual and interactive generation of wrappers for Web pages under the supervision of a human developer, for automatically extracting informatio...
Robert Baumgartner, Sergio Flesca, Georg Gottlob
ICDAR
2003
IEEE
14 years 27 days ago
Document page similarity based on layout visual saliency: Application to query by example and document classification
In this paper we propose to define a measure of visual similarity to compare different pages in a corpus. This measure is based on the analysis of the visual layout saliency of th...
Véronique Eglin, Stéphane Bres
LREC
2008
133views Education» more  LREC 2008»
13 years 9 months ago
Automatic Identification of Temporal Information in Tourism Web Pages
This paper presents our work on the detection of temporal information in web pages. The pages examined within the scope of this study were taken from the tourism sector and the te...
Stéphanie Weiser, Philippe Laublet, Jean-Lu...
WEBDB
1999
Springer
196views Database» more  WEBDB 1999»
13 years 12 months ago
Web Ecology: Recycling HTML Pages as XML Documents Using W4F
In this paper we present the World-Wide Web Wrapper Factory (W4F), a Java toolkit to generate wrappers for Web data sources. Some key features of W4F are an expressive language to...
Arnaud Sahuguet, Fabien Azavant
ICDM
2007
IEEE
149views Data Mining» more  ICDM 2007»
14 years 1 months ago
Extracting Author Meta-Data from Web Using Visual Features
Enriching digital library’s author meta-data can lead to valuable services and applications. This paper addresses the problem of extracting authors’ information from their hom...
Shuyi Zheng, Ding Zhou, Jia Li, C. Lee Giles