Sciweavers

502 search results - page 83 / 101
» Extracting Partial Structures from HTML Documents
Sort
View
JUCS
2008
210views more  JUCS 2008»
13 years 9 months ago
Systematic Characterisation of Objects in Digital Preservation: The eXtensible Characterisation Languages
: During the last decades, digital objects have become the primary medium to create, shape, and exchange information. However, in contrast to analog objects such as books that dire...
Christoph Becker, Andreas Rauber, Volker Heydegger...
HICSS
1998
IEEE
112views Biometrics» more  HICSS 1998»
14 years 2 months ago
AESOP: An Outline-Oriented Authoring System
Because a hypermedia document is more complex than conventional text, it requires preparation with respect to two key aspects. First, the author begins to develop a "vision&q...
Takeshi Shimizu, Stephen W. Smoliar, John S. Borec...
IJDAR
2006
103views more  IJDAR 2006»
13 years 9 months ago
Table-processing paradigms: a research survey
Tables are a ubiquitous form of communication. While everyone seems to know what a table is, a precise, analytical definition of "tabularity" remains elusive because some...
David W. Embley, Matthew Hurst, Daniel P. Lopresti...
ECWEB
2005
Springer
127views ECommerce» more  ECWEB 2005»
14 years 3 months ago
Knowledge Discovery in Web-Directories: Finding Term-Relations to Build a Business Ontology
The Web continues to grow at a tremendous rate. Search engines find it increasingly difficult to provide useful results. To manage this explosively large number of Web documents,...
Sandip Debnath, Tracy Mullen, Arun Upneja, C. Lee ...
DEXA
2007
Springer
154views Database» more  DEXA 2007»
14 years 3 months ago
Beyond Lazy XML Parsing
XML has become the standard format for data representation and exchange in domains ranging from Web to desktop applications. However, wide adoption of XML is hindered by inefficien...
Fernando Farfán, Vagelis Hristidis, Raju Ra...