Sciweavers

311 search results - page 19 / 63
» XTRACT: A System for Extracting Document Type Descriptors fr...
Sort
View
IDEAS
2008
IEEE
153views Database» more  IDEAS 2008»
14 years 2 months ago
Pattern based processing of XPath queries
As the popularity of areas including document storage and distributed systems continues to grow, the demand for high performance XML databases is increasingly evident. This has le...
Gerard Marks, Mark Roantree
WWW
2006
ACM
14 years 8 months ago
Using graph matching techniques to wrap data from PDF documents
Wrapping is the process of navigating a data source, semiautomatically extracting data and transforming it into a form suitable for data processing applications. There are current...
Tamir Hassan, Robert Baumgartner
AIEDU
2008
174views more  AIEDU 2008»
13 years 7 months ago
Automatic Extraction of Pedagogic Metadata from Learning Content
Annotating learning material with metadata allows easy reusability by different learning/tutoring systems. Several metadata standards have been developed to represent learning obje...
Devshri Roy, Sudeshna Sarkar, Sujoy Ghose
SIGIR
2003
ACM
14 years 27 days ago
XML retrieval: what to retrieve?
The fundamental difference between standard information retrieval and XML retrieval is the unit of retrieval. In traditional IR, the unit of retrieval is fixed: it is the comple...
Jaap Kamps, Maarten Marx, Maarten de Rijke, Bö...
EMNLP
2007
13 years 9 months ago
Bootstrapping Information Extraction from Field Books
We present two machine learning approaches to information extraction from semi-structured documents that can be used if no annotated training data are available, but there does ex...
Sander Canisius, Caroline Sporleder