Sciweavers

502 search results - page 14 / 101
» Extracting Partial Structures from HTML Documents
Sort
View
SIGMOD
2000
ACM
236views Database» more  SIGMOD 2000»
14 years 24 days ago
XTRACT: A System for Extracting Document Type Descriptors from XML Documents
XML is rapidly emerging as the new standard for data representation and exchange on the Web. An XML document can be accompanied by a Document Type Descriptor (DTD) which plays the...
Minos N. Garofalakis, Aristides Gionis, Rajeev Ras...
WWW
2006
ACM
14 years 9 months ago
Relaxed: on the way towards true validation of compound documents
To maintain interoperability in the Web environment it is necessary to comply with Web standards. Current specifications of HTML and XHTML languages define conformance conditions ...
Jirka Kosek, Petr Nálevka
BIBE
2004
IEEE
156views Bioinformatics» more  BIBE 2004»
14 years 5 days ago
GeneWebEx: Gene Annotation Web Extraction, Aggregation, and Updating from Web-Based Biomolecular Databanks
Numerous genomic annotations are currently stored in different web-accessible databanks that scientists need to mine with user-defined queries and in a batch mode to orderly integ...
Marco Masseroli, Andrea Stella, Natalia Meani, Myr...
ICALT
2006
IEEE
14 years 2 months ago
A Semi-Automatic Tool using Ontology to Extract Learning Objects
The approach presented in this paper is intended for the semi-automatic construction of a learning object repository from HTML pages. An extraction method consists of applying the...
Bich-Liên Doan, Yolaine Bourda, Vasile Dumit...
WWW
2004
ACM
14 years 9 months ago
OntoMiner: bootstrapping ontologies from overlapping domain specific web sites
In this paper, we present automated techniques for bootstrapping and populating specialized domain ontologies by organizing and mining a set of relevant overlapping Web sites prov...
Hasan Davulcu, Srinivas Vadrevu, Saravanakumar Nag...