Sciweavers

ICDM
2010
IEEE
189views Data Mining» more  ICDM 2010»
13 years 5 months ago
S4: Distributed Stream Computing Platform
Abstract--S4 is a general-purpose, distributed, scalable, partially fault-tolerant, pluggable platform that allows programmers to easily develop applications for processing continu...
Leonardo Neumeyer, Bruce Robbins, Anish Nair, Anan...
ICDM
2010
IEEE
200views Data Mining» more  ICDM 2010»
13 years 5 months ago
Bayesian Maximum Margin Clustering
Abstract--Most well-known discriminative clustering models, such as spectral clustering (SC) and maximum margin clustering (MMC), are non-Bayesian. Moreover, they merely considered...
Bo Dai, Baogang Hu, Gang Niu
ICDM
2010
IEEE
228views Data Mining» more  ICDM 2010»
13 years 5 months ago
Active Learning from Multiple Noisy Labelers with Varied Costs
In active learning, where a learning algorithm has to purchase the labels of its training examples, it is often assumed that there is only one labeler available to label examples, ...
Yaling Zheng, Stephen D. Scott, Kun Deng
ICDM
2010
IEEE
232views Data Mining» more  ICDM 2010»
13 years 5 months ago
gSkeletonClu: Density-Based Network Clustering via Structure-Connected Tree Division or Agglomeration
Community detection is an important task for mining the structure and function of complex networks. Many pervious approaches are difficult to detect communities with arbitrary size...
Heli Sun, Jianbin Huang, Jiawei Han, Hongbo Deng, ...
ICDM
2010
IEEE
235views Data Mining» more  ICDM 2010»
13 years 5 months ago
Content-Based Methods for Predicting Web-Site Demographic Attributes
Demographic information plays an important role in gaining valuable insights about a web-site's user-base and is used extensively to target online advertisements and promotion...
Santosh Kabbur, Eui-Hong Han, George Karypis
ICDM
2009
IEEE
138views Data Mining» more  ICDM 2009»
13 years 6 months ago
Scalable Attribute-Value Extraction from Semi-structured Text
Yuk Wah Wong, Dominic Widdows, Tom Lokovic, Kamal ...
ICDM
2009
IEEE
202views Data Mining» more  ICDM 2009»
13 years 6 months ago
Link Prediction on Evolving Data Using Matrix and Tensor Factorizations
Abstract--The data in many disciplines such as social networks, web analysis, etc. is link-based, and the link structure can be exploited for many different data mining tasks. In t...
Evrim Acar, Daniel M. Dunlavy, Tamara G. Kolda
ICDM
2009
IEEE
184views Data Mining» more  ICDM 2009»
13 years 6 months ago
Improved Multi Label Classification in Hierarchical Taxonomies
Hierarchical taxonomies are used to organize and retrieve information in many domains, especially those dealing with large and rapidly growing amounts of information. In many of t...
Kunal Punera, Suju Rajan
ICDM
2009
IEEE
171views Data Mining» more  ICDM 2009»
13 years 6 months ago
Hybrid Clustering by Integrating Text and Citation Based Graphs in Journal Database Analysis
We propose a hybrid clustering strategy by integrating heterogeneous information sources as graphs. The hybrid clustering method is extended on the basis of modularity based Louva...
Xinhai Liu, Shi Yu, Yves Moreau, Frizo A. L. Janss...