Sciweavers

2497 search results - page 218 / 500
» A Partial-Repeatability Approach to Data Mining
Sort
View
141
Voted
KDD
2007
ACM
152views Data Mining» more  KDD 2007»
16 years 5 months ago
Efficient incremental constrained clustering
Clustering with constraints is an emerging area of data mining research. However, most work assumes that the constraints are given as one large batch. In this paper we explore the...
Ian Davidson, S. S. Ravi, Martin Ester
KDD
2003
ACM
214views Data Mining» more  KDD 2003»
16 years 5 months ago
Adaptive duplicate detection using learnable string similarity measures
The problem of identifying approximately duplicate records in databases is an essential step for data cleaning and data integration processes. Most existing approaches have relied...
Mikhail Bilenko, Raymond J. Mooney
KDD
2002
ACM
119views Data Mining» more  KDD 2002»
16 years 5 months ago
On effective classification of strings with wavelets
In recent years, the technological advances in mapping genes have made it increasingly easy to store and use a wide variety of biological data. Such data are usually in the form o...
Charu C. Aggarwal
SIGMOD
2010
ACM
260views Database» more  SIGMOD 2010»
15 years 9 months ago
Towards proximity pattern mining in large graphs
Mining graph patterns in large networks is critical to a variety of applications such as malware detection and biological module discovery. However, frequent subgraphs are often i...
Arijit Khan, Xifeng Yan, Kun-Lung Wu
180
Voted
EDBT
2012
ACM
257views Database» more  EDBT 2012»
13 years 7 months ago
Indexing and mining topological patterns for drug discovery
Increased availability of large repositories of chemical compounds has created new challenges and opportunities for the application of data-mining and indexing techniques to probl...
Sayan Ranu, Ambuj K. Singh