Sciweavers

960 search results - page 146 / 192
» CURE: An Efficient Clustering Algorithm for Large Databases
Sort
View
125
Voted
CIKM
2008
Springer
15 years 5 months ago
Scaling up duplicate detection in graph data
Duplicate detection determines different representations of realworld objects in a database. Recent research has considered the use of relationships among object representations t...
Melanie Herschel, Felix Naumann
159
Voted
ICCV
2007
IEEE
16 years 5 months ago
Interactive Search for Image Categories by Mental Matching
Traditional image retrieval methods require a "query image" to initiate a search for members of an image category. However, when the image database is unstructured, and ...
Marin Ferecatu, Donald Geman
117
Voted
SC
2005
ACM
15 years 9 months ago
Optimized Data Loading for a Multi-Terabyte Sky Survey Repository
Advanced instruments in a variety of scientific domains are collecting massive amounts of data that must be postprocessed and organized to support research activities. Astronomers...
Y. Dora Cai, Ruth A. Aydt, Robert Brunner
256
Voted
ICDE
2008
IEEE
203views Database» more  ICDE 2008»
16 years 5 months ago
Training Linear Discriminant Analysis in Linear Time
Linear Discriminant Analysis (LDA) has been a popular method for extracting features which preserve class separability. It has been widely used in many fields of information proces...
Deng Cai, Xiaofei He, Jiawei Han
145
Voted
SIGMOD
2003
ACM
145views Database» more  SIGMOD 2003»
16 years 3 months ago
Containment Join Size Estimation: Models and Methods
Recent years witnessed an increasing interest in researches in XML, partly due to the fact that XML has now become the de facto standard for data interchange over the internet. A ...
Wei Wang 0011, Haifeng Jiang, Hongjun Lu, Jeffrey ...