Sciweavers

129 search results - page 9 / 26
» Combining content extraction heuristics: the CombinE system
Sort
View
AI
2005
Springer
13 years 9 months ago
Integrating Web Content Clustering into Web Log Association Rule Mining
Abstract. One of the effects of the general Internet growth is an immense number of user accesses to WWW resources. These accesses are recorded in the web server log files, which...
Jiayun Guo, Vlado Keselj, Qigang Gao
ICIP
1999
IEEE
14 years 9 months ago
Water-Filling: A Novel Way for Image Structural Feature Extraction
The performance of a content based image retrieval (CBIR) system is inherently constrained by the features adopted to represent the images in the database. In this paper, a new ap...
Xiang Sean Zhou, Yong Rui, Thomas S. Huang
ACL
2009
13 years 5 months ago
Annotating and Recognising Named Entities in Clinical Notes
This paper presents ongoing research in clinical information extraction. This work introduces a new genre of text which are not well-written, noise prone, ungrammatical and with m...
Yefeng Wang
TREC
2007
13 years 8 months ago
NLPR in TREC 2007 Blog Track
This paper describes the opinion retrieval system for TREC 2007 blog track. This paper focuses on two components of the system. One component is important content block detection ...
Kang Liu, Gen Wang, Xianpei Han, Jun Zhao
WWW
2008
ACM
14 years 8 months ago
Improving web spam detection with re-extracted features
Web spam detection has become one of the top challenges for the Internet search industry. Instead of using some heuristic rules, we propose a feature re-extraction strategy to opt...
Guanggang Geng, Chunheng Wang, Qiudan Li