Sciweavers

498 search results - page 14 / 100
» Robust web content extraction
Sort
View
AI
2005
Springer
13 years 8 months ago
Integrating Web Content Clustering into Web Log Association Rule Mining
Abstract. One of the effects of the general Internet growth is an immense number of user accesses to WWW resources. These accesses are recorded in the web server log files, which...
Jiayun Guo, Vlado Keselj, Qigang Gao
KDD
2002
ACM
148views Data Mining» more  KDD 2002»
14 years 7 months ago
Discovering informative content blocks from Web documents
In this paper, we propose a new approach to discover informative contents from a set of tabular documents (or Web pages) of a Web site. Our system, InfoDiscoverer, first partition...
Shian-Hua Lin, Jan-Ming Ho
WIRI
2005
IEEE
14 years 9 days ago
Extended Link Analysis for Extracting Spatial Information Hubs
Recently, web mining that tries to find useful knowledge from the vast amount of web pages has attracted a lot of research interests. Besides, it is becoming an essential task to...
Jianwei Zhang 0002, Yoshiharu Ishikawa, Hiroyuki K...
WWW
2005
ACM
14 years 8 days ago
An information extraction engine for web discussion forums
In this poster, we present an information extraction engine for web-based forums. The engine analyzes the HTML files crawled from web forums, deduces the wrapper (template) of the...
Hanny Yulius Limanto, Nguyen Ngoc Giang, Vo Tan Tr...
ICPR
2000
IEEE
14 years 7 months ago
Robust Extraction of Text in Video
Despite advances in the archiving of digital video, we are still unable to efficiently search and retrieve the portions that interest us. Video indexing by shot segmentation has b...
Sameer Antani, David J. Crandall, Rangachar Kastur...