Sciweavers

5733 search results - page 93 / 1147
» Clustering Categorical Data
Sort
View
CIDM
2007
IEEE
14 years 4 months ago
Scalable Clustering for Large High-Dimensional Data Based on Data Summarization
Clustering large data sets with high dimensionality is a challenging data-mining task. This paper presents a framework to perform such a task efficiently. It is based on the notio...
Ying Lai, Ratko Orlandic, Wai Gen Yee, Sachin Kulk...
CIKM
2011
Springer
12 years 10 months ago
LogSig: generating system events from raw textual logs
Modern computing systems generate large amounts of log data. System administrators or domain experts utilize the log data to understand and optimize system behaviors. Most system ...
Liang Tang, Tao Li, Chang-Shing Perng
WSDM
2010
ACM
194views Data Mining» more  WSDM 2010»
14 years 7 months ago
Ranking with Query-Dependent Loss for Web Search
Queries describe the users' search intent and therefore they play an essential role in the context of ranking for information retrieval and Web search. However, most of exist...
Jiang Bian, Tie-Yan Liu, Tao Qin, Hongyuan Zha
FUZZIEEE
2007
IEEE
14 years 4 months ago
Prototype-less Fuzzy Clustering
Abstract—In contrast to standard fuzzy clustering, which optimizes a set of prototypes, one for each cluster, this paper studies fuzzy clustering without prototypes. Starting fro...
Christian Borgelt
ICML
2010
IEEE
13 years 10 months ago
Nonparametric Information Theoretic Clustering Algorithm
In this paper we propose a novel clustering algorithm based on maximizing the mutual information between data points and clusters. Unlike previous methods, we neither assume the d...
Lev Faivishevsky, Jacob Goldberger