Sciweavers

502 search results - page 13 / 101
» Extracting Partial Structures from HTML Documents
Sort
View
DOCENG
2009
ACM
15 years 8 months ago
From rhetorical structures to document structure: shallow pragmatic analysis for document engineering
In this paper, we extend previous work on the automatic structuring of medical documents using content analysis. Our long-term objective is to take advantage of specific rhetoric ...
Gersende Georg, Hugo Hernault, Marc Cavazza, Helmu...
100
Voted
WWW
2006
ACM
16 years 2 months ago
Logical structure based semantic relationship extraction from semi-structured documents
Addressed in this paper is the issue of semantic relationship extraction from semi-structured documents. Many research efforts have been made so far on the semantic information ex...
Kuo Zhang, Gang Wu, Juan-Zi Li
CIKM
1998
Springer
15 years 6 months ago
Ontology-Based Extraction and Structuring of Information from Data-Rich Unstructured Documents
We present a new approach to extracting information from unstructured documents based on an application ontology that describes a domain of interest. Starting with such an ontolog...
David W. Embley, Douglas M. Campbell, Randy D. Smi...
IEICET
2006
116views more  IEICET 2006»
15 years 2 months ago
Extraction of Semantic Text Portion Related to Anchor Link
Recently, semantic text portion (STP) is getting popular in the field of Web mining. STP is a text portion in the original page which is semantically related to the anchor pointing...
Bui Quang Hung, Masanori Otsubo, Yoshinori Hijikat...
ECML
2001
Springer
15 years 6 months ago
Wrapping Web Information Providers by Transducer Induction
Modern agent and mediator systems communicate to a multitude of Web information providers to better satisfy user requests. They use wrappers to extract relevant information from HT...
Boris Chidlovskii