Sciweavers

96 search results - page 5 / 20
» Detecting Near-replicas on the Web by Content and Hyperlink ...
Sort
View
MM
2004
ACM
109views Multimedia» more  MM 2004»
14 years 1 months ago
Reading movies: an integrated DVD player for browsing movies and their scripts
We have built over the last few years an integrated browser and query interface for watching a movie synchronized with its script. The system is demonstrated with the movie ’The...
Rémi Ronfard
WWW
2004
ACM
14 years 8 months ago
Web page summarization using dynamic content
Summarizing web pages have recently gained much attention from researchers. Until now two main types of approaches have been proposed for this task: content- and context-based met...
Adam Jatowt
AIRWEB
2008
Springer
13 years 9 months ago
Exploring linguistic features for web spam detection: a preliminary study
We study the usability of linguistic features in the Web spam classification task. The features were computed on two Web spam corpora: Webspam-Uk2006 and Webspam-Uk2007, we make t...
Jakub Piskorski, Marcin Sydow, Dawid Weiss
LAWEB
2003
IEEE
14 years 28 days ago
On the Image Content of the Chilean Web
In this paper we perform a study of the image contents of the Chilean web (.cl domain) using automatic feature extraction, content-based analysis and face detection algorithms. In...
Alejandro Jaimes, Javier Ruiz-del-Solar, Rodrigo V...
WWW
2009
ACM
14 years 8 months ago
Efficient overlap and content reuse detection in blogs and online news articles
The use of blogs to track and comment on real world (political, news, entertainment) events is growing. Similarly, as more individuals start relying on the Web as their primary in...
Jong Wook Kim, Jun'ichi Tatemura, K. Selçuk...