Sciweavers

301 search results - page 40 / 61
» Metrics for Mining Multisets
Sort
View
DS
1997
117views Database» more  DS 1997»
13 years 9 months ago
Experience with a Combined Approach to Attribute-Matching Across Heterogeneous Databases
Determining attribute correspondences is a difficult, time-consuming, knowledge-intensive part of database integration. We report on experiences with tools that identified candi...
Chris Clifton, E. Housman, Arnon Rosenthal
JPDC
2007
138views more  JPDC 2007»
13 years 7 months ago
Distributed computation of the knn graph for large high-dimensional point sets
High-dimensional problems arising from robot motion planning, biology, data mining, and geographic information systems often require the computation of k nearest neighbor (knn) gr...
Erion Plaku, Lydia E. Kavraki
SAC
2009
ACM
14 years 2 months ago
Evaluating algorithms that learn from data streams
In the past years, the theory and practice of machine learning and data mining have been focused on static and finite data sets from where learning algorithms generate a static m...
João Gama, Pedro Pereira Rodrigues, Raquel ...
EUROPAR
2007
Springer
14 years 2 months ago
Are P2P Data-Dissemination Techniques Viable in Today's Data-Intensive Scientific Collaborations?
The interest among a geographically distributed user base to mine massive collections of scientific data propels the need for efficient data dissemination solutions. An optimal dat...
Samer Al-Kiswany, Matei Ripeanu, Adriana Iamnitchi...
HPCC
2005
Springer
14 years 1 months ago
A Coarse Grained Parallel Algorithm for Closest Larger Ancestors in Trees with Applications to Single Link Clustering
Hierarchical clustering methods are important in many data mining and pattern recognition tasks. In this paper we present an efficient coarse grained parallel algorithm for Single...
Albert Chan, Chunmei Gao, Andrew Rau-Chaplin