Sciweavers

263 search results - page 13 / 53
» Re-engineering structures from Web documents
Sort
View
IICS
2004
Springer
14 years 26 days ago
Towards Logical Hypertext Structure
Facing the retrieval problem according to the overwhelming set of documents online the adaptation of text categorization to web units has recently been pushed. The aim is to utiliz...
Alexander Mehler, Matthias Dehmer, Rüdiger Gl...
WEBDB
1999
Springer
196views Database» more  WEBDB 1999»
13 years 11 months ago
Web Ecology: Recycling HTML Pages as XML Documents Using W4F
In this paper we present the World-Wide Web Wrapper Factory (W4F), a Java toolkit to generate wrappers for Web data sources. Some key features of W4F are an expressive language to...
Arnaud Sahuguet, Fabien Azavant
BMCBI
2007
176views more  BMCBI 2007»
13 years 7 months ago
The Firegoose: two-way integration of diverse data from different bioinformatics web resources with desktop applications
Background: Information resources on the World Wide Web play an indispensable role in modern biology. But integrating data from multiple sources is often encumbered by the need to...
J. Christopher Bare, Paul T. Shannon, Amy K. Schmi...
PVLDB
2008
141views more  PVLDB 2008»
13 years 7 months ago
WebTables: exploring the power of tables on the web
The World-Wide Web consists of a huge number of unstructured documents, but it also contains structured data in the form of HTML tables. We extracted 14.1 billion HTML tables from...
Michael J. Cafarella, Alon Y. Halevy, Daisy Zhe Wa...
SAC
2000
ACM
13 years 12 months ago
A Synchronization Model for Hypermedia Documents Navigation
This paper presents a model for describing the synchronization between several media delivered over a network in a Web-based environment. Synchronization concerns the download and...
Augusto Celentano, Ombretta Gaggi