Sciweavers

732 search results - page 93 / 147
» Mining Postal Addresses
Sort
View
SDM
2008
SIAM
157views Data Mining» more  SDM 2008»
13 years 9 months ago
ROC-tree: A Novel Decision Tree Induction Algorithm Based on Receiver Operating Characteristics to Classify Gene Expression Data
Gene expression information from microarray experiments is a primary form of data for biological analysis and can offer insights into disease processes and cellular behaviour. Suc...
M. Maruf Hossain, Md. Rafiul Hassan, James Bailey
SDM
2008
SIAM
114views Data Mining» more  SDM 2008»
13 years 9 months ago
Semi-Supervised Classification with Universum
The Universum data, defined as a collection of "nonexamples" that do not belong to any class of interest, have been shown to encode some prior knowledge by representing ...
Dan Zhang, Jingdong Wang, Fei Wang, Changshui Zhan...
SDM
2008
SIAM
133views Data Mining» more  SDM 2008»
13 years 9 months ago
Semantic Smoothing for Bayesian Text Classification with Small Training Data
Bayesian text classifiers face a common issue which is referred to as data sparsity problem, especially when the size of training data is very small. The frequently used Laplacian...
Xiaohua Zhou, Xiaodan Zhang, Xiaohua Hu
SDM
2007
SIAM
143views Data Mining» more  SDM 2007»
13 years 9 months ago
Patterns of Cascading Behavior in Large Blog Graphs
How do blogs cite and influence each other? How do such links evolve? Does the popularity of old blog posts drop exponentially with time? These are some of the questions that we ...
Jure Leskovec, Mary McGlohon, Christos Faloutsos, ...
SDM
2007
SIAM
117views Data Mining» more  SDM 2007»
13 years 9 months ago
Summarizing Review Scores of "Unequal" Reviewers
A frequently encountered problem in decision making is the following review problem: review a large number of objects and select a small number of the best ones. An example is sel...
Hady Wirawan Lauw, Ee-Peng Lim, Ke Wang