Sciweavers

563 search results - page 48 / 113
» Crawling the web for structured documents
Sort
View
DOCENG
2008
ACM
15 years 5 months ago
Identifying and expanding titles in web texts
In this paper, we present an analysis based on linguistic and typographic features that allows for the identification of titles in web documents. We focus in particular on procedu...
Clémentine Adam, Estelle Delpech, Patrick S...
SEMWIKI
2008
134views Data Mining» more  SEMWIKI 2008»
15 years 5 months ago
A Real Semantic Web for Mathematics Deserves a Real Semantics
Abstract. Mathematical documents, and their instrumentation by computers, have rich structure at the layers of presentation, metadata and semantics, as objects in a system for form...
Cezary Kaliszyk, Pierre Corbineau, Freek Wiedijk, ...
LWA
2008
15 years 5 months ago
Labeling Clusters - Tagging Resources
In order to support the navigation in huge document collections efficiently, tagged hierarchical structures can be used. Often, multiple tags are used to describe resources. For u...
Korinna Bade, Andreas Nürnberger
CN
1998
207views more  CN 1998»
15 years 3 months ago
The Anatomy of a Large-Scale Hypertextual Web Search Engine
In this paper, we present Google, a prototype of a large-scale search engine which makes heavy use of the structure present in hypertext. Google is designed to crawl and index the...
Sergey Brin, Lawrence Page
APLAS
2004
ACM
15 years 9 months ago
An Algebraic Approach to Bi-directional Updating
In many occasions would one encounter the task of maintaining the consistency of two pieces of structured data that are related by some transform — synchronising bookmarks in diï...
Shin-Cheng Mu, Zhenjiang Hu, Masato Takeichi