Sciweavers

960 search results - page 101 / 192
» CURE: An Efficient Clustering Algorithm for Large Databases
Sort
View
KDD
1997
ACM
78views Data Mining» more  KDD 1997»
15 years 8 months ago
Mining Generalized Term Associations: Count Propagation Algorithm
We presenthere an approachand algorithm for mining generalizedterm associations.The problem is to find co-occurrencefrequenciesof terms, given a collection of documents eachwith r...
Jonghyun Kahng, Wen-Hsiang Kevin Liao, Dennis McLe...
WWW
2007
ACM
16 years 4 months ago
SPARQ2L: towards support for subgraph extraction queries in rdf databases
Many applications in analytical domains often have the need to "connect the dots" i.e., query about the structure of data. In bioinformatics for example, it is typical t...
Kemafor Anyanwu, Angela Maduko, Amit P. Sheth
SIGMOD
2009
ACM
291views Database» more  SIGMOD 2009»
16 years 4 months ago
Partial join order optimization in the paraccel analytic database
The ParAccel Analytic DatabaseTM is a fast shared-nothing parallel relational database system with a columnar orientation, adaptive compression, memory-centric design, and an enha...
Yijou Chen, Richard L. Cole, William J. McKenna, S...
SIGMOD
2007
ACM
112views Database» more  SIGMOD 2007»
16 years 4 months ago
A random walk approach to sampling hidden databases
A large part of the data on the World Wide Web is hidden behind form-like interfaces. These interfaces interact with a hidden backend database to provide answers to user queries. ...
Arjun Dasgupta, Gautam Das, Heikki Mannila
ICML
2004
IEEE
15 years 9 months ago
Active learning using pre-clustering
The paper is concerned with two-class active learning. While the common approach for collecting data in active learning is to select samples close to the classification boundary,...
Hieu Tat Nguyen, Arnold W. M. Smeulders