Sciweavers

2137 search results - page 122 / 428
» Extraction of Structural Information from the Web
Sort
View
CIKM
2005
Springer
15 years 10 months ago
Structural features in content oriented XML retrieval
The structural features of XML components are an extra source of information that should be used in a contentoriented retrieval task on this type of documents. This paper explores...
Georgina Ramírez, Thijs Westerveld, Arjen P...
WWW
2008
ACM
16 years 5 months ago
As we may perceive: finding the boundaries of compound documents on the web
This paper considers the problem of identifying on the Web compound documents (cDocs) ? groups of web pages that in aggregate constitute semantically coherent information entities...
Pavel Dmitriev
ADL
1997
Springer
125views Digital Library» more  ADL 1997»
15 years 8 months ago
Error Tolerant Document Structure Analysis
Successful applications of digital libraries require structured access to sources of information. This paper presents an approach to extract the logical structure of text document...
Bertin Klein, Peter Fankhauser
WWW
2001
ACM
16 years 5 months ago
Mixed-initiative, multi-source information assistants
While the information resources on the Web are vast, the sources are often hard to find, painful to use, and difficult to integrate. We have developed the Heracles framework for b...
Craig A. Knoblock, Steven Minton, José Luis...
CN
1998
105views more  CN 1998»
15 years 4 months ago
WebL - A Programming Language for the Web
In this paper we introduce a programming language for Web document processing called WebL. WebL is a high level, object-oriented scripting language that incorporates two novel fea...
Thomas Kistler, Hannes Marais