Search Sciweavers | Sciweavers

32 search results - page 4 / 7

» Near-duplicate detection for web-forums

157

click to vote

WWW
2008
ACM

177views Internet Technology» more WWW 2008»

Social and semantics analysis via non-negative matrix factorization

16 years 6 months ago

Download www2008.org

Social media such as Web forum often have dense interactions between user and content where network models are often appropriate for analysis. Joint non-negative matrix factorizat...

Zhi-Li Wu, Chi-Wa Cheng, Chun-hung Li

claim paper

Read More »

169

Voted

CIVR
2007
Springer

273views Image Analysis» more CIVR 2007»

Scalable near identical image and shot detection

16 years 7 days ago

Download cmp.felk.cvut.cz

This paper proposes and compares two novel schemes for near duplicate image and video-shot detection. The ﬁrst approach is based on global hierarchical colour histograms, using ...

Ondrej Chum, James Philbin, Michael Isard, Andrew ...

claim paper

Read More »

164

click to vote

DIS
2007
Springer

106views Theoretical Computer Science» more DIS 2007»

Unsupervised Spam Detection Based on String Alienness Measures

16 years 7 days ago

Download www.i.kyushu-u.ac.jp

We propose an unsupervised method for detecting spam documents from Web page data, based on equivalence relations on strings. We propose 3 measures for quantifying the alienness (...

Kazuyuki Narisawa, Hideo Bannai, Kohei Hatano, Mas...

claim paper

Read More »

159

click to vote

KDD
2004
ACM

195views Data Mining» more KDD 2004»

Improved robustness of signature-based near-replica detection via lexicon randomization

16 years 6 months ago

Download ir.iit.edu

Detection of near duplicate documents is an important problem in many data mining and information filtering applications. When faced with massive quantities of data, traditional d...

Aleksander Kolcz, Abdur Chowdhury, Joshua Alspecto...

claim paper

Read More »

183

click to vote

MM
2009
ACM

249views Multimedia» more MM 2009»

MyFinder: near-duplicate detection for large image collections

15 years 10 months ago

Download www.uweb.ucsb.edu

The explosive growth of multimedia data poses serious challenges to data storage, management and search. Efficient near-duplicate detection is one of the required technologies for...

Xin Yang, Qiang Zhu, Kwang-Ting Cheng

claim paper

Read More »

« Prev « First page 4 / 7 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers