Sciweavers

79 search results - page 12 / 16
» IMDC: An Image-Mapped Data Clustering Technique for Large Da...
Sort
View
SIGMOD
2011
ACM
210views Database» more  SIGMOD 2011»
12 years 11 months ago
A platform for scalable one-pass analytics using MapReduce
Today’s one-pass analytics applications tend to be data-intensive in nature and require the ability to process high volumes of data efficiently. MapReduce is a popular programm...
Boduo Li, Edward Mazur, Yanlei Diao, Andrew McGreg...
BMCBI
2006
171views more  BMCBI 2006»
13 years 8 months ago
The effect of oligonucleotide microarray data pre-processing on the analysis of patient-cohort studies
Background: Intensity values measured by Affymetrix microarrays have to be both normalized, to be able to compare different microarrays by removing non-biological variation, and s...
Roel G. W. Verhaak, Frank J. T. Staal, Peter J. M....
BMCBI
2008
114views more  BMCBI 2008»
13 years 8 months ago
Partial mixture model for tight clustering of gene expression time-course
Background: Tight clustering arose recently from a desire to obtain tighter and potentially more informative clusters in gene expression studies. Scattered genes with relatively l...
Yinyin Yuan, Chang-Tsun Li, Roland Wilson
DATAMINE
2006
89views more  DATAMINE 2006»
13 years 8 months ago
Scalable Clustering Algorithms with Balancing Constraints
Clustering methods for data-mining problems must be extremely scalable. In addition, several data mining applications demand that the clusters obtained be balanced, i.e., be of ap...
Arindam Banerjee, Joydeep Ghosh
PVLDB
2010
129views more  PVLDB 2010»
13 years 7 months ago
Entity Resolution with Evolving Rules
Entity resolution (ER) identifies database records that refer to the same real world entity. In practice, ER is not a one-time process, but is constantly improved as the data, sc...
Steven Whang, Hector Garcia-Molina