Sciweavers

764 search results - page 119 / 153
» Using Data Mining for Wine Quality Assessment
Sort
View
ICPP
2000
IEEE
15 years 8 months ago
A Scalable Parallel Subspace Clustering Algorithm for Massive Data Sets
Clustering is a data mining problem which finds dense regions in a sparse multi-dimensional data set. The attribute values and ranges of these regions characterize the clusters. ...
Harsha S. Nagesh, Sanjay Goil, Alok N. Choudhary
WWW
2005
ACM
16 years 4 months ago
The volume and evolution of web page templates
Web pages contain a combination of unique content and template material, which is present across multiple pages and used primarily for formatting, navigation, and branding. We stu...
David Gibson, Kunal Punera, Andrew Tomkins
109
Voted
WSDM
2009
ACM
148views Data Mining» more  WSDM 2009»
15 years 10 months ago
Information arbitrage across multi-lingual Wikipedia
The rapid globalization of Wikipedia is generating a parallel, multi-lingual corpus of unprecedented scale. Pages for the same topic in many different languages emerge both as a r...
Eytan Adar, Michael Skinner, Daniel S. Weld
156
Voted
PKDD
2010
Springer
148views Data Mining» more  PKDD 2010»
15 years 2 months ago
Exploiting Causal Independence in Markov Logic Networks: Combining Undirected and Directed Models
Abstract. A new method is proposed for compiling causal independencies into Markov logic networks (MLNs). An MLN can be viewed as compactly representing a factorization of a joint ...
Sriraam Natarajan, Tushar Khot, Daniel Lowd, Prasa...
141
Voted
ICDM
2009
IEEE
145views Data Mining» more  ICDM 2009»
15 years 1 months ago
Significance of Episodes Based on Minimal Windows
Discovering episodes, frequent sets of events from a sequence has been an active field in pattern mining. Traditionally, a level-wise approach is used to discover all frequent epis...
Nikolaj Tatti