Sciweavers

96 search results - page 7 / 20
» Detecting Near-replicas on the Web by Content and Hyperlink ...
Sort
View
IAT
2006
IEEE
14 years 1 months ago
A Web-Based System to Monitor the Quality of Meta-Data in Web Portals
We present a web-based system to monitor the quality of the meta-data used to describe content in web portals. The system implements meta-data analysis using statistical, visualiz...
Marcos Aurélio Domingues, Carlos Soares, Al...
SAC
2006
ACM
14 years 1 months ago
Template detection for large scale search engines
Templates in web sites hurt search engine retrieval performance, especially in content relevance and link analysis. Current template removal methods suffer from processing speed ...
Liang Chen, Shaozhi Ye, Xing Li
WWW
2008
ACM
14 years 8 months ago
Social and semantics analysis via non-negative matrix factorization
Social media such as Web forum often have dense interactions between user and content where network models are often appropriate for analysis. Joint non-negative matrix factorizat...
Zhi-Li Wu, Chi-Wa Cheng, Chun-hung Li
WWW
2011
ACM
13 years 2 months ago
Prophiler: a fast filter for the large-scale detection of malicious web pages
Malicious web pages that host drive-by-download exploits have become a popular means for compromising hosts on the Internet and, subsequently, for creating large-scale botnets. In...
Davide Canali, Marco Cova, Giovanni Vigna, Christo...
WWW
2006
ACM
14 years 8 months ago
An audio/video analysis mechanism for web indexing
The high availability of video streams is making necessary mechanisms for indexing such contents in the Web world. In this paper we focus on news programs and we propose a mechani...
Marco Furini, Marco Aragone