Sciweavers

KDD
1998
ACM
140views Data Mining» more  KDD 1998»
14 years 26 days ago
Blurring the Distinction between Command and Data in Scientific KDD
We have been working on two different KDD systems for scientific data. One system involves comparative genomics, where the database contains more than 60,000 plant gene and protei...
John V. Carlis, Elizabeth Shoop, Scott Krieger
KDD
1998
ACM
84views Data Mining» more  KDD 1998»
14 years 26 days ago
Similarity of Attributes by External Probes
In data mining, similarity or distance between attributes is one of the central notions. Such a notion can be used to build attribute hierarchies etc. Similarity metrics can be us...
Gautam Das, Heikki Mannila, Pirjo Ronkainen
KDD
1998
ACM
141views Data Mining» more  KDD 1998»
14 years 26 days ago
Rule Discovery from Time Series
We consider the problem of nding rules relating patterns in a time series to other patterns in that series, or patterns in one series to patterns in another series. A simple examp...
Gautam Das, King-Ip Lin, Heikki Mannila, Gopal Ren...
KDD
1998
ACM
107views Data Mining» more  KDD 1998»
14 years 26 days ago
Giga-Mining
Wedescribe an industrial-strength data mining application in telecommunications.Theapplication requires building a short (7 byte) profile for all telephonenumbersseen on a large t...
Corinna Cortes, Daryl Pregibon
KDD
1998
ACM
102views Data Mining» more  KDD 1998»
14 years 26 days ago
Joins that Generalize: Text Classification Using WHIRL
WHIRL is an extensionof relational databasesthat canperform "soft joins" basedon the similarity of textual identifiers;thesesoftjoins extendthe traditional operationof j...
William W. Cohen, Haym Hirsh
KDD
1998
ACM
123views Data Mining» more  KDD 1998»
14 years 26 days ago
Scaling Clustering Algorithms to Large Databases
Practical clustering algorithms require multiple data scans to achieve convergence. For large databases, these scans become prohibitively expensive. We present a scalable clusteri...
Paul S. Bradley, Usama M. Fayyad, Cory Reina
KDD
1998
ACM
228views Data Mining» more  KDD 1998»
14 years 26 days ago
Direct Marketing Response Models Using Genetic Algorithms
Direct marketing response models seek to identify individuals most likely to respond to marketing solicitations. Such models are commonly evaluatedon classification accuracyand so...
Siddhartha Bhattacharyya
KDD
1998
ACM
103views Data Mining» more  KDD 1998»
14 years 26 days ago
CLOUDS: A Decision Tree Classifier for Large Datasets
Khaled Alsabti, Sanjay Ranka, Vineet Singh
KDD
1998
ACM
94views Data Mining» more  KDD 1998»
14 years 26 days ago
Independence Diagrams: A Technique for Visual Data Mining
An important issue in data mining is the recognition of complex dependencies between attributes. Past techniques for identifying attribute dependence include correlation coefficie...
Stefan Berchtold, H. V. Jagadish, Kenneth A. Ross
KDD
1998
ACM
146views Data Mining» more  KDD 1998»
14 years 26 days ago
Mining Association Rules in Hypertext Databases
In this workweproposea generalisation of the notion of associationrule in the contextof flat transactions to that of a compositeassociation rule in the context of a structured dir...
José Borges, Mark Levene