Sciweavers

118 search results - page 13 / 24
» Methods for Linking and Mining Massive Heterogeneous Databas...
Sort
View
PPOPP
2010
ACM
14 years 8 months ago
A distributed placement service for graph-structured and tree-structured data
Effective data placement strategies can enhance the performance of data-intensive applications implemented on high end computing clusters. Such strategies can have a significant i...
Gregory Buehrer, Srinivasan Parthasarathy, Shirish...
VLDB
2005
ACM
196views Database» more  VLDB 2005»
14 years 4 months ago
Summarizing and Mining Inverse Distributions on Data Streams via Dynamic Inverse Sampling
Emerging data stream management systems approach the challenge of massive data distributions which arrive at high speeds while there is only small storage by summarizing and minin...
Graham Cormode, S. Muthukrishnan, Irina Rozenbaum
ICDM
2009
IEEE
121views Data Mining» more  ICDM 2009»
14 years 5 months ago
Finding Time Series Motifs in Disk-Resident Data
—Time series motifs are sets of very similar subsequences of a long time series. They are of interest in their own right, and are also used as inputs in several higher-level data...
Abdullah Mueen, Eamonn J. Keogh, Nima Bigdely Sham...
KDD
2007
ACM
154views Data Mining» more  KDD 2007»
14 years 11 months ago
Canonicalization of database records using adaptive similarity measures
It is becoming increasingly common to construct databases from information automatically culled from many heterogeneous sources. For example, a research publication database can b...
Aron Culotta, Michael L. Wick, Robert Hall, Matthe...
BMCBI
2006
107views more  BMCBI 2006»
13 years 10 months ago
CROPPER: a metagene creator resource for cross-platform and cross-species compendium studies
Background: Current genomic research methods provide researchers with enormous amounts of data. Combining data from different high-throughput research technologies commonly availa...
Jussi Paananen, Markus Storvik, Garry Wong