Sciweavers

498 search results - page 12 / 100
» Robust web content extraction
Sort
View
LPNMR
2001
Springer
13 years 11 months ago
Declarative Information Extraction, Web Crawling, and Recursive Wrapping with Lixto
Lixto is a system and method for the visual and interactive generation of wrappers for Web pages under the supervision of a human developer, for automatically extracting informatio...
Robert Baumgartner, Sergio Flesca, Georg Gottlob
IEEEMSP
2002
IEEE
156views Multimedia» more  IEEEMSP 2002»
13 years 11 months ago
A robust and secure media signature scheme for JPEG images
—In [1, 2, 3], we have introduced a robust and secure digital signature solution for multimedia content authentication, by integrating content feature extraction, error correctio...
Qibin Sun, Qi Tian, Shih-Fu Chang
ICNC
2005
Springer
14 years 7 days ago
Using SOFM to Improve Web Site Text Content
We introduce a new method to improve web site text content by identifying the most relevant free text in the web pages. In order to understand the variations in web page text, we c...
Sebastián A. Ríos, Juan D. Vel&aacut...
ICDE
2004
IEEE
117views Database» more  ICDE 2004»
14 years 8 months ago
Probe, Cluster, and Discover: Focused Extraction of QA-Pagelets from the Deep Web
In this paper, we introduce the concept of a QA-Pagelet to refer to the content region in a dynamic page that contains query matches. We present THOR, a scalable and efficient min...
James Caverlee, Ling Liu, David Buttler
WWW
2007
ACM
14 years 7 months ago
Semantic personalization of web portal contents
Enriching Web applications with personalized data is of major interest for facilitating the user access to the published contents, and therefore, for guaranteeing successful user ...
Christina Tziviskou, Marco Brambilla