Sciweavers

330 search results - page 22 / 66
» Document structure analysis algorithms: a literature survey
Sort
View
CIKM
2008
Springer
13 years 9 months ago
Using structured text for large-scale attribute extraction
We propose a weakly-supervised approach for extracting class attributes from structured text available within Web documents. The overall precision of the extracted attributes is a...
Sujith Ravi, Marius Pasca
DOCENG
2010
ACM
13 years 5 months ago
Diffing, patching and merging XML documents: toward a generic calculus of editing deltas
This work addresses what we believe to be a central issue in the field of XML diff and merge computation: the mathematical modeling o-called editing deltas and the study of their ...
Jean-Yves Vion-Dury
ICDAR
2009
IEEE
14 years 2 months ago
Hybrid Page Layout Analysis via Tab-Stop Detection
A new hybrid page layout analysis algorithm is proposed, which uses bottom-up methods to form an initial data-type hypothesis and locate the tab-stops that were used when the page...
Raymond W. Smith
GROUP
2005
ACM
14 years 1 months ago
Proactive support for the organization of shared workspaces using activity patterns and content analysis
Shared workspace systems provide virtual places for selforganized and semi-structured cooperation between local and distributed team members. These cooperation systems have been a...
Wolfgang Prinz, Baber Zaman
AIIA
2007
Springer
14 years 1 months ago
Harvesting Relational and Structured Knowledge for Ontology Building in the WPro Architecture
We present two algorithms for supporting semi-automatic ontology building, integrated in WPro, a new architecture for ontology learning from Web documents. The first algorithm auto...
Daniele Bagni, Marco Cappella, Maria Teresa Pazien...