Sciweavers

ICDM
2006
IEEE
143views Data Mining» more  ICDM 2006»
14 years 2 months ago
Applying Data Mining to Pseudo-Relevance Feedback for High Performance Text Retrieval
In this paper, we investigate the use of data mining, in particular the text classification and co-training techniques, to identify more relevant passages based on a small set of...
Xiangji Huang, Yan Rui Huang, Miao Wen, Aijun An, ...
ICDM
2006
IEEE
193views Data Mining» more  ICDM 2006»
14 years 2 months ago
Local Correlation Tracking in Time Series
We address the problem of capturing and tracking local correlations among time evolving time series. Our approach is based on comparing the local auto-covariance matrices (via the...
Spiros Papadimitriou, Jimeng Sun, Philip S. Yu
ICDM
2006
IEEE
122views Data Mining» more  ICDM 2006»
14 years 2 months ago
Optimal Segmentation Using Tree Models
Sequence data are abundant in application areas such as computational biology, environmental sciences, and telecommunications. Many real-life sequences have a strong segmental str...
Robert Gwadera, Aristides Gionis, Heikki Mannila
ICDM
2006
IEEE
89views Data Mining» more  ICDM 2006»
14 years 2 months ago
Plagiarism Detection in arXiv
We describe a large-scale application of methods for finding plagiarism and self-plagiarism in research document collections. The methods are applied to a collection of 284,834 d...
Daria Sorokina, Johannes Gehrke, Simeon Warner, Pa...
ICDM
2006
IEEE
135views Data Mining» more  ICDM 2006»
14 years 2 months ago
SAXually Explicit Images: Finding Unusual Shapes
Among the visual features of multimedia content, shape is of particular interest because humans can often recognize objects solely on the basis of shape. Over the past three decad...
Li Wei, Eamonn J. Keogh, Xiaopeng Xi
ICDM
2006
IEEE
132views Data Mining» more  ICDM 2006»
14 years 2 months ago
High Quality, Efficient Hierarchical Document Clustering Using Closed Interesting Itemsets
High dimensionality remains a significant challenge for document clustering. Recent approaches used frequent itemsets and closed frequent itemsets to reduce dimensionality, and to...
Hassan H. Malik, John R. Kender
ICDM
2006
IEEE
86views Data Mining» more  ICDM 2006»
14 years 2 months ago
Turning Clusters into Patterns: Rectangle-Based Discriminative Data Description
The ultimate goal of data mining is to extract knowledge from massive data. Knowledge is ideally represented as human-comprehensible patterns from which end-users can gain intuiti...
Byron J. Gao, Martin Ester
ICDM
2006
IEEE
127views Data Mining» more  ICDM 2006»
14 years 2 months ago
The Relationships Among Various Nonnegative Matrix Factorization Methods for Clustering
The nonnegative matrix factorization (NMF) has been shown recently to be useful for clustering. Various extensions of NMF have also been proposed. In this paper we present an over...
Tao Li, Chris H. Q. Ding
ICDE
2006
IEEE
124views Database» more  ICDE 2006»
14 years 2 months ago
Systematic Approach for Optimizing Complex Mining Tasks on Multiple Databases
It has been well recognized that data mining is an interactive and iterative process. In order to support this process, one of the long-term goals of data mining research has been...
Ruoming Jin, Gagan Agrawal
ICDE
2006
IEEE
165views Database» more  ICDE 2006»
14 years 2 months ago
Privacy Preserving Clustering on Horizontally Partitioned Data
Data mining has been a popular research area for more than a decade due to its vast spectrum of applications. The power of data mining tools to extract hidden information that can...
Ali Inan, Yücel Saygin, Erkay Savas, Ay&ccedi...