Data Mining | Sciweavers

209

Voted

KDD
2008
ACM

120views Data Mining» more KDD 2008»

Entity categorization over large document collections

16 years 7 months ago

Extracting entities (such as people, movies) from documents and identifying the categories (such as painter, writer) they belong to enable structured querying and data analysis ov...

Arnd Christian König, Rares Vernica, Venkates...

claim paper

Read More »

236

click to vote

KDD
2008
ACM

257views Data Mining» more KDD 2008»

Knowledge discovery of semantic relationships between words using nonparametric bayesian graph model

16 years 7 months ago

Download www.r.dl.itc.u-tokyo.ac.jp

We developed a model based on nonparametric Bayesian modeling for automatic discovery of semantic relationships between words taken from a corpus. It is aimed at discovering seman...

Issei Sato, Minoru Yoshida, Hiroshi Nakagawa

claim paper

Read More »

215

click to vote

KDD
2008
ACM

206views Data Mining» more KDD 2008»

Identifying biologically relevant genes via multiple heterogeneous data sources

16 years 7 months ago

Download www.public.asu.edu

Selection of genes that are differentially expressed and critical to a particular biological process has been a major challenge in post-array analysis. Recent development in bioin...

Zheng Zhao, Jiangxin Wang, Huan Liu, Jieping Ye, Y...

claim paper

Read More »

202

Voted

KDD
2008
ACM

156views Data Mining» more KDD 2008»

Can complex network metrics predict the behavior of NBA teams?

16 years 7 months ago

Download www.cs.unc.edu

The United States National Basketball Association (NBA) is one of the most popular sports league in the world and is well known for moving a millionary betting market that uses th...

Antonio Alfredo Ferreira Loureiro, Pedro O. S. Vaz...

claim paper

Read More »

176

click to vote

KDD
2008
ACM

146views Data Mining» more KDD 2008»

Constraint programming for itemset mining

16 years 7 months ago

Download www.cs.kuleuven.be

The relationship between constraint-based mining and constraint programming is explored by showing how the typical constraints used in pattern mining can be formulated for use in ...

Luc De Raedt, Tias Guns, Siegfried Nijssen

claim paper

Read More »

206

click to vote

KDD
2008
ACM

234views Data Mining» more KDD 2008»

Angle-based outlier detection in high-dimensional data

16 years 7 months ago

Download www.dbs.informatik.uni-muenchen.de

Detecting outliers in a large set of data objects is a major data mining task aiming at finding different mechanisms responsible for different groups of objects in a data set. All...

Hans-Peter Kriegel, Matthias Schubert, Arthur Zime...

claim paper

Read More »

215

click to vote

KDD
2008
ACM

183views Data Mining» more KDD 2008»

De-duping URLs via rewrite rules

16 years 7 months ago

Download research.yahoo.com

A large fraction of the URLs on the web contain duplicate (or near-duplicate) content. De-duping URLs is an extremely important problem for search engines, since all the principal...

Anirban Dasgupta, Ravi Kumar, Amit Sasturkar

claim paper

Read More »

211

Voted

KDD
2008
ACM

140views Data Mining» more KDD 2008»

Semi-supervised approach to rapid and reliable labeling of large data sets

16 years 7 months ago

Download www-users.cs.umn.edu

Supervised classification methods have been shown to be very effective for a large number of applications. They require a training data set whose instances are labeled to indicate...

György J. Simon, Vipin Kumar, Zhi-Li Zhang

claim paper

Read More »

184

click to vote

KDD
2008
ACM

167views Data Mining» more KDD 2008»

A sequential dual method for large scale multi-class linear svms

16 years 7 months ago

Download www.csie.ntu.edu.tw

Efficient training of direct multi-class formulations of linear Support Vector Machines is very useful in applications such as text classification with a huge number examples as w...

S. Sathiya Keerthi, S. Sundararajan, Kai-Wei Chang...

claim paper

Read More »

244

click to vote

KDD
2008
ACM

192views Data Mining» more KDD 2008»

Partial least squares regression for graph mining

16 years 7 months ago

Download eprints.pascal-network.org

Attributed graphs are increasingly more common in many application domains such as chemistry, biology and text processing. A central issue in graph mining is how to collect inform...

Hiroto Saigo, Koji Tsuda, Nicole Krämer

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers