Sciweavers

301 search results - page 50 / 61
» Metrics for Mining Multisets
Sort
View
SIGMOD
2004
ACM
184views Database» more  SIGMOD 2004»
14 years 8 months ago
Identifying Similarities, Periodicities and Bursts for Online Search Queries
We present several methods for mining knowledge from the query logs of the MSN search engine. Using the query logs, we build a time series for each query word or phrase (e.g., `Th...
Michail Vlachos, Christopher Meek, Zografoula Vage...
SDM
2009
SIAM
225views Data Mining» more  SDM 2009»
14 years 5 months ago
Integrated KL (K-means - Laplacian) Clustering: A New Clustering Approach by Combining Attribute Data and Pairwise Relations.
Most datasets in real applications come in from multiple sources. As a result, we often have attributes information about data objects and various pairwise relations (similarity) ...
Fei Wang, Chris H. Q. Ding, Tao Li
SDM
2009
SIAM
202views Data Mining» more  SDM 2009»
14 years 5 months ago
Proximity-Based Anomaly Detection Using Sparse Structure Learning.
We consider the task of performing anomaly detection in highly noisy multivariate data. In many applications involving real-valued time-series data, such as physical sensor data a...
Tsuyoshi Idé, Aurelie C. Lozano, Naoki Abe,...
SAC
2010
ACM
14 years 2 months ago
Feature selection for ordinal regression
Ordinal regression (also known as ordinal classification) is a supervised learning task that consists of automatically determining the implied rating of a data item on a fixed, ...
Stefano Baccianella, Andrea Esuli, Fabrizio Sebast...
SAC
2010
ACM
14 years 2 months ago
Estimating node similarity from co-citation in a spatial graph model
Co-citation (number of nodes linking to both of a given pair of nodes) is often used heuristically to judge similarity between nodes in a complex network. We investigate the relat...
Jeannette Janssen, Pawel Pralat, Rory Wilson