Sciweavers

502 search results - page 17 / 101
» Extracting Partial Structures from HTML Documents
Sort
View
CHI
1996
ACM
14 years 18 days ago
Silk from a Sow's Ear: Extracting Usable Structures from the Web
In its current implementation, the World-Wide Web lacks much of the explicit structure and strong typing found in many closed hypertext systems. While this property has directly f...
Peter Pirolli, James E. Pitkow, Ramana Rao
DKE
1998
146views more  DKE 1998»
13 years 8 months ago
A Case study of Automatic Authoring: From a Textbook to a Hyper-Textbook
This paper presents a case-study of automatic construction of a hypertext from a large full-text document. The document we used as input of the automatic authoring process is a we...
Fabio Crestani, Massimo Melucci
DAS
2010
Springer
13 years 6 months ago
Information extraction by finding repeated structure
Repetition of layout structure is prevalent in document images. In document design, such repetition conveys the underlying logical and functional structure of the data. For exampl...
Evgeniy Bart, Prateek Sarkar
DASFAA
2005
IEEE
153views Database» more  DASFAA 2005»
14 years 2 months ago
FASST Mining: Discovering Frequently Changing Semantic Structure from Versions of Unordered XML Documents
Abstract. In this paper, we present a FASST mining approach to extract the frequently changing semantic structures (FASSTs), which are a subset of semantic substructures that chang...
Qiankun Zhao, Sourav S. Bhowmick
WWW
2008
ACM
14 years 9 months ago
Extracting XML schema from multiple implicit xml documents based on inductive reasoning
We propose a method of classifying XML documents and extracting XML schema from XML by inductive inference based on constraint logic programming. The goal of this work is to type ...
Masaya Eki, Tadachika Ozono, Toramatsu Shintani