Sciweavers

127 search results - page 23 / 26
» Approximately Mining Recently Representative Patterns on Dat...
Sort
View
KDD
2000
ACM
149views Data Mining» more  KDD 2000»
13 years 10 months ago
Efficient clustering of high-dimensional data sets with application to reference matching
Many important problems involve clustering large datasets. Although naive implementations of clustering are computationally expensive, there are established efficient techniques f...
Andrew McCallum, Kamal Nigam, Lyle H. Ungar
KDD
2003
ACM
122views Data Mining» more  KDD 2003»
14 years 7 months ago
Discovery of climate indices using clustering
To analyze the effect of the oceans and atmosphere on land climate, Earth Scientists have developed climate indices, which are time series that summarize the behavior of selected ...
Michael Steinbach, Pang-Ning Tan, Vipin Kumar, Ste...
SIGMOD
2003
ACM
121views Database» more  SIGMOD 2003»
14 years 7 months ago
An environmental sensor network to determine drinking water quality and security
Finding patterns in large, real, spatio/temporal data continues to attract high interest (e.g., sales of products over space and time, patterns in mobile phone users; sensor netwo...
Anastassia Ailamaki, Christos Faloutsos, Paul S. F...
BIBM
2010
IEEE
151views Bioinformatics» more  BIBM 2010»
13 years 4 months ago
Probabilistic topic modeling for genomic data interpretation
Recently, the concept of a species containing both core and distributed genes, known as the supra- or pangenome theory, has been introduced. In this paper, we aim to develop a new ...
Xin Chen, Xiaohua Hu, Xiajiong Shen, Gail Rosen
ICDM
2009
IEEE
141views Data Mining» more  ICDM 2009»
14 years 1 months ago
Scalable Algorithms for Distribution Search
Distribution data naturally arise in countless domains, such as meteorology, biology, geology, industry and economics. However, relatively little attention has been paid to data m...
Yasuko Matsubara, Yasushi Sakurai, Masatoshi Yoshi...