Sciweavers

309 search results - page 3 / 62
» Discovering informative content blocks from Web documents
Sort
View
AUSAI
2003
Springer
14 years 21 days ago
Semi-Automatic Construction of Metadata from a Series of Web Documents
Metadata plays an important role in discovering, collecting, extracting and aggregating Web data. This paper proposes a method of constructing metadata for a specific topic. The m...
Sachio Hirokawa, Eisuke Itoh, Tetsuhiro Miyahara
DEXAW
2010
IEEE
181views Database» more  DEXAW 2010»
13 years 8 months ago
Towards a Search System for the Web Exploiting Spatial Data of a Web Document
In this paper, we describe our work in progress in the scope of information retrieval exploiting the spatial data extracted from web documents. We discuss problems of a search for ...
Stefan Dlugolinsky, Michal Laclavik, Ladislav Hluc...
WIRI
2005
IEEE
14 years 1 months ago
Postal Address Detection from Web Documents
An approach to postal address detection from webpages is proposed. The webpages are first segmented into text blocks based on their visual similarity. The text content in each bl...
Lin Can, Zhang Qian, Xiaofeng Meng, Wenyin Lin
ACL
2006
13 years 8 months ago
Examining the Content Load of Part of Speech Blocks for Information Retrieval
We investigate the connection between part of speech (POS) distribution and content in language. We define POS blocks to be groups of parts of speech. We hypothesise that there ex...
Christina Lioma, Iadh Ounis
WIDM
1998
ACM
13 years 11 months ago
WebML: Querying the World-Wide Web for Resources and Knowledge
There is a massive increase of information available on electronic networks. This profusion of resources on the WorldWide Web gave rise to considerable interest in the research co...
Osmar R. Zaïane, Jiawei Han