Sciweavers

2277 search results - page 12 / 456
» Clustering by pattern similarity in large data sets
Sort
View
ICDM
2002
IEEE
159views Data Mining» more  ICDM 2002»
14 years 15 days ago
O-Cluster: Scalable Clustering of Large High Dimensional Data Sets
Clustering large data sets of high dimensionality has always been a serious challenge for clustering algorithms. Many recently developed clustering algorithms have attempted to ad...
Boriana L. Milenova, Marcos M. Campos
WSDM
2009
ACM
198views Data Mining» more  WSDM 2009»
14 years 2 months ago
Measuring the similarity between implicit semantic relations using web search engines
Measuring the similarity between implicit semantic relations is an important task in information retrieval and natural language processing. For example, consider the situation whe...
Danushka Bollegala, Yutaka Matsuo, Mitsuru Ishizuk...
BMCBI
2007
265views more  BMCBI 2007»
13 years 7 months ago
Large scale clustering of protein sequences with FORCE -A layout based heuristic for weighted cluster editing
Background: Detecting groups of functionally related proteins from their amino acid sequence alone has been a long-standing challenge in computational genome research. Several clu...
Tobias Wittkop, Jan Baumbach, Francisco P. Lobo, S...
ICDM
2003
IEEE
138views Data Mining» more  ICDM 2003»
14 years 25 days ago
PixelMaps: A New Visual Data Mining Approach for Analyzing Large Spatial Data Sets
PixelMaps are a new pixel-oriented visual data mining technique for large spatial datasets. They combine kerneldensity-based clustering with pixel-oriented displays to emphasize c...
Daniel A. Keim, Christian Panse, Mike Sips, Stephe...
WSDM
2010
ACM
204views Data Mining» more  WSDM 2010»
14 years 2 months ago
Learning URL patterns for webpage de-duplication
Presence of duplicate documents in the World Wide Web adversely affects crawling, indexing and relevance, which are the core building blocks of web search. In this paper, we pres...
Hema Swetha Koppula, Krishna P. Leela, Amit Agarwa...