Search Sciweavers | Sciweavers

207

SIGIR
2010
ACM

169views Information Technology» more SIGIR 2010»

Efficient partial-duplicate detection based on sequence matching

15 years 2 months ago

With the ever-increasing growth of the Internet, numerous copies of documents become serious problem for search engine, opinion mining and many other web applications. Since parti...

Qi Zhang, Yue Zhang, Haomin Yu, Xuanjing Huang

claim paper

Read More »

313

click to vote

SIGMOD
2008
ACM

157views Database» more SIGMOD 2008»

CRD: fast co-clustering on large datasets utilizing sampling-based matrix decomposition

16 years 7 months ago

Download compgen.unc.edu

The problem of simultaneously clustering columns and rows (coclustering) arises in important applications, such as text data mining, microarray analysis, and recommendation system...

Feng Pan, Xiang Zhang, Wei Wang 0010

claim paper

Read More »

233

click to vote

JCDL
2004
ACM

198views Education» more JCDL 2004»

Finding authoritative people from the web

16 years 26 days ago

Download www.ingrid.org

Today’s web is so huge and diverse that it arguably reﬂects the real world. For this reason, searching the web is a promising approach to ﬁnd things in the real world. This ...

Masanori Harada, Shin-ya Sato, Kazuhiro Kazama

claim paper

Read More »

242

click to vote

KDD
2007
ACM

237views Data Mining» more KDD 2007»

Knowledge discovery of multiple-topic document using parametric mixture model with dirichlet prior

16 years 7 months ago

Download www.r.dl.itc.u-tokyo.ac.jp

Documents, such as those seen on Wikipedia and Folksonomy, have tended to be assigned with multiple topics as a meta-data. Therefore, it is more and more important to analyze a re...

Issei Sato, Hiroshi Nakagawa

claim paper

Read More »

217

click to vote

KDD
2010
ACM

247views Data Mining» more KDD 2010»

Active learning for biomedical citation screening

15 years 9 months ago

Download tuftscaes.org

Active learning (AL) is an increasingly popular strategy for mitigating the amount of labeled data required to train classiﬁers, thereby reducing annotator eﬀort. We describe ...

Byron C. Wallace, Kevin Small, Carla E. Brodley, T...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers