Sciweavers

630 search results - page 91 / 126
» Sameness: An Experiment in Code Search
Sort
View
KDD
2002
ACM
93views Data Mining» more  KDD 2002»
14 years 9 months ago
Interactive deduplication using active learning
Deduplication is a key operation in integrating data from multiple sources. The main challenge in this task is designing a function that can resolve when a pair of records refer t...
Sunita Sarawagi, Anuradha Bhamidipaty
SIGMOD
2004
ACM
196views Database» more  SIGMOD 2004»
14 years 9 months ago
FARMER: Finding Interesting Rule Groups in Microarray Datasets
Microarray datasets typically contain large number of columns but small number of rows. Association rules have been proved to be useful in analyzing such datasets. However, most e...
Gao Cong, Anthony K. H. Tung, Xin Xu, Feng Pan, Ji...
CHI
2010
ACM
14 years 3 months ago
The effect of audience design on labeling, organizing, and finding shared files
In an online experiment, I apply theory from psychology and communications to find out whether group information management tasks are governed by the same communication processes...
Emilee Rader
EVOW
2010
Springer
14 years 3 months ago
Towards Automatic Detecting of Overlapping Genes - Clustered BLAST Analysis of Viral Genomes
Overlapping genes (encoded on the same DNA strand but in different frames) are thought to be rare and, therefore, were largely neglected in the past. In a test set of 800 viruses ...
Klaus Neuhaus, Daniela Oelke, David Fürst, Si...
CIKM
2009
Springer
14 years 3 months ago
Improving web page classification by label-propagation over click graphs
In this paper, we present a semi-supervised learning method for web page classification, leveraging click logs to augment training data by propagating class labels to unlabeled si...
Soo-Min Kim, Patrick Pantel, Lei Duan, Scott Gaffn...