Search Sciweavers | Sciweavers

170

WWW
2008
ACM

214views Internet Technology» more WWW 2008»

16 years 6 months ago

Efficient similarity joins for near duplicate detection

Download www2008.org

With the increasing amount of data and the need to integrate data from multiple data sources, a challenging issue is to find near duplicate records efficiently. In this paper, we ...

Chuan Xiao, Wei Wang 0011, Xuemin Lin, Jeffrey Xu ...

claim paper

Read More »

166

click to vote

WWW
2004
ACM

120views Internet Technology» more WWW 2004»

Graph-based text database for knowledge discovery

16 years 6 months ago

Download www.iw3c2.org

While we expect to discover knowledge in the texts available on the Web, such discovery usually requires many complex analysis steps, most of which require different text handling...

Junji Tomita, Hidekazu Nakawatase, Megumi Ishii

claim paper

Read More »

172

click to vote

KDD
2009
ACM

141views Data Mining» more KDD 2009»

Meme-tracking and the dynamics of the news cycle

16 years 6 months ago

Download www.cs.cornell.edu

Tracking new topics, ideas, and "memes" across the Web has been an issue of considerable interest. Recent work has developed methods for tracking topic shifts over long ...

Jure Leskovec, Lars Backstrom, Jon M. Kleinberg

claim paper

Read More »

171

click to vote

KDD
2008
ACM

243views Data Mining» more KDD 2008»

Permu-pattern: discovery of mutable permutation patterns with proximity constraint

16 years 6 months ago

Download making.csie.ndhu.edu.tw

Pattern discovery in sequences is an important problem in many applications, especially in computational biology and text mining. However, due to the noisy nature of data, the tra...

Meng Hu, Jiong Yang, Wei Su

claim paper

Read More »

156

click to vote

HPCA
2001
IEEE

109views Distributed And Parallel Com...» more HPCA 2001»

A New Scalable Directory Architecture for Large-Scale Multiprocessors

16 years 6 months ago

Download ditec.um.es

The memory overhead introduced by directories constitutes a major hurdle in the scalability of cc-NUMA architectures, which makes the shared-memory paradigm unfeasible for very la...

Manuel E. Acacio, José González, Jos...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers