Sciweavers

684 search results - page 6 / 137
» Extracting semantic structure of web documents using content...
Sort
View
DAS
2004
Springer
14 years 27 days ago
An Integrated Approach for Automatic Semantic Structure Extraction in Document Images
In this paper we present an integrated approach for semantic structure extraction in document images. Document images are initially processed to extract both their layout and logic...
Margherita Berardi, Michele Lapi, Donato Malerba
SAMT
2007
Springer
108views Multimedia» more  SAMT 2007»
14 years 1 months ago
Document Layout Substructure Discovery
Abstract. In this paper we present a system, DoLSuD, for the automatic discovery of relevant substructures in a document layout. DoLSuD, Document Layout Substructure Discovery, ext...
Claudio Andreatta
CIKM
2009
Springer
14 years 2 months ago
Annotating wikipedia articles with semantic tags for structured retrieval
Structured retrieval aims at exploiting the structural information of documents when searching for documents. Structured retrieval makes use of both content and structure of docum...
Saravadee Sae Tan, Tang Enya Kong, Gian Chand Sodh...
WWW
2008
ACM
14 years 8 months ago
Web page sectioning using regex-based template
This work aims to provide a novel, site-specific web page segmentation and section importance detection algorithm, which leverages structural, content, and visual information. The...
Rupesh R. Mehta, Amit Madaan
ICANN
2005
Springer
14 years 1 months ago
Content-Based Retrieval of Web Pages and Other Hierarchical Objects with Self-organizing Maps
We propose a content-based information retrieval (CBIR) method that models known relationships between multimedia objects as a hierarchical tree-structure incorporating additional ...
Mats Sjöberg, Jorma Laaksonen