Sciweavers

1749 search results - page 78 / 350
» An ICA algorithm for analyzing multiple data sets
Sort
View
SIGMOD
2010
ACM
174views Database» more  SIGMOD 2010»
14 years 2 months ago
Sampling dirty data for matching attributes
We investigate the problem of creating and analyzing samples of relational databases to find relationships between string-valued attributes. Our focus is on identifying attribute...
Henning Köhler, Xiaofang Zhou, Shazia Wasim S...
DASFAA
2004
IEEE
134views Database» more  DASFAA 2004»
14 years 1 months ago
Applying Co-training to Clickthrough Data for Search Engine Adaptation
The information on the World Wide Web is growing without bound. Users may have very diversified preferences in the pages they target through a search engine. It is therefore a chal...
Qingzhao Tan, Xiaoyong Chai, Wilfred Ng, Dik Lun L...
DMKD
2003
ACM
96views Data Mining» more  DMKD 2003»
14 years 2 months ago
Using transposition for pattern discovery from microarray data
We analyze expression matrices to identify a priori interesting sets of genes, e.g., genes that are frequently co-regulated. Such matrices provide expression values for given biol...
François Rioult, Jean-François Bouli...
PODC
2010
ACM
13 years 11 months ago
Distributed data classification in sensor networks
Low overhead analysis of large distributed data sets is necessary for current data centers and for future sensor networks. In such systems, each node holds some data value, e.g., ...
Ittay Eyal, Idit Keidar, Raphael Rom
RECOMB
2008
Springer
14 years 9 months ago
CompostBin: A DNA Composition-Based Algorithm for Binning Environmental Shotgun Reads
A major hindrance to studies of microbial diversity has been that the vast majority of microbes cannot be cultured in the laboratory and thus are not amenable to traditional method...
Sourav Chatterji, Ichitaro Yamazaki, Zhaojun Bai, ...