Sciweavers

KDD
2005
ACM
130views Data Mining» more  KDD 2005»
14 years 12 months ago
Simple and effective visual models for gene expression cancer diagnostics
In the paper we show that diagnostic classes in cancer gene expression data sets, which most often include thousands of features (genes), may be effectively separated with simple ...
Gregor Leban, Minca Mramor, Ivan Bratko, Blaz Zupa...
KDD
2005
ACM
85views Data Mining» more  KDD 2005»
14 years 12 months ago
A multiple tree algorithm for the efficient association of asteroid observations
Jeremy Kubica, Andrew W. Moore, Andrew Connolly, R...
KDD
2005
ACM
99views Data Mining» more  KDD 2005»
14 years 12 months ago
Determining an author's native language by mining a text for errors
In this paper, we show that stylistic text features can be exploited to determine an anonymous author's native language with high accuracy. Specifically, we first use automat...
Moshe Koppel, Jonathan Schler, Kfir Zigdon
KDD
2005
ACM
112views Data Mining» more  KDD 2005»
14 years 12 months ago
Data mining in the chemical industry
Alex N. Kalos, Tim Rey
KDD
2005
ACM
218views Data Mining» more  KDD 2005»
14 years 12 months ago
A maximum entropy web recommendation system: combining collaborative and content features
Web users display their preferences implicitly by navigating through a sequence of pages or by providing numeric ratings to some items. Web usage mining techniques are used to ext...
Xin Jin, Yanzan Zhou, Bamshad Mobasher
KDD
2005
ACM
162views Data Mining» more  KDD 2005»
14 years 12 months ago
Discovering frequent topological structures from graph datasets
The problem of finding frequent patterns from graph-based datasets is an important one that finds applications in drug discovery, protein structure analysis, XML querying, and soc...
Ruoming Jin, Chao Wang, Dmitrii Polshakov, Sriniva...
KDD
2005
ACM
106views Data Mining» more  KDD 2005»
14 years 12 months ago
Simultaneous optimization of complex mining tasks with a knowledgeable cache
With an increasing use of data mining tools and techniques, we envision that a Knowledge Discovery and Data Mining System (KDDMS) will have to support and optimize for the followi...
Ruoming Jin, Kaushik Sinha, Gagan Agrawal
KDD
2005
ACM
103views Data Mining» more  KDD 2005»
14 years 12 months ago
Fast discovery of unexpected patterns in data, relative to a Bayesian network
We consider a model in which background knowledge on a given domain of interest is available in terms of a Bayesian network, in addition to a large database. The mining problem is...
Szymon Jaroszewicz, Tobias Scheffer
KDD
2005
ACM
168views Data Mining» more  KDD 2005»
14 years 12 months ago
Nomograms for visualizing support vector machines
We propose a simple yet potentially very effective way of visualizing trained support vector machines. Nomograms are an established model visualization technique that can graphica...
Aleks Jakulin, Martin Mozina, Janez Demsar, Ivan B...