Sciweavers

131 search results - page 5 / 27
» Ranking-Constrained Keyword Sequence Extraction from Web Doc...
Sort
View
CHI
1996
ACM
13 years 10 months ago
Silk from a Sow's Ear: Extracting Usable Structures from the Web
In its current implementation, the World-Wide Web lacks much of the explicit structure and strong typing found in many closed hypertext systems. While this property has directly f...
Peter Pirolli, James E. Pitkow, Ramana Rao
CAISE
2003
Springer
13 years 12 months ago
Extending an on-line information site with accurate domain-dependent extracts from the World Wide Web
This paper describes a new procedure that has been developed for extending an existing on-line information system about The Voyages of the Beagle with information collected automat...
Enrique Alfonseca, Pilar Rodríguez
KDD
2002
ACM
148views Data Mining» more  KDD 2002»
14 years 7 months ago
Discovering informative content blocks from Web documents
In this paper, we propose a new approach to discover informative contents from a set of tabular documents (or Web pages) of a Web site. Our system, InfoDiscoverer, first partition...
Shian-Hua Lin, Jan-Ming Ho
AUSAI
2003
Springer
13 years 12 months ago
Semi-Automatic Construction of Metadata from a Series of Web Documents
Metadata plays an important role in discovering, collecting, extracting and aggregating Web data. This paper proposes a method of constructing metadata for a specific topic. The m...
Sachio Hirokawa, Eisuke Itoh, Tetsuhiro Miyahara
TMM
2002
140views more  TMM 2002»
13 years 6 months ago
Narrowing the semantic gap - improved text-based web document retrieval using visual features
In this paper, we present the results of our work that seek to negotiate the gap between low-level features and high-level concepts in the domain of web document retrieval. This wo...
Rong Zhao, William I. Grosky