Sciweavers

483 search results - page 71 / 97
» Sampling the Web as Training Data for Text Classification
Sort
View
156
Voted
BMCBI
2007
140views more  BMCBI 2007»
15 years 2 months ago
Prediction potential of candidate biomarker sets identified and validated on gene expression data from multiple datasets
Background: Independently derived expression profiles of the same biological condition often have few genes in common. In this study, we created populations of expression profiles...
Michael Gormley, William Dampier, Adam Ertel, Bilg...
153
Voted
IJCAI
2001
15 years 4 months ago
Active Learning for Class Probability Estimation and Ranking
For many supervised learning tasks it is very costly to produce training data with class labels. Active learning acquires data incrementally, at each stage using the model learned...
Maytal Saar-Tsechansky, Foster J. Provost
ICDM
2010
IEEE
226views Data Mining» more  ICDM 2010»
15 years 16 days ago
Edge Weight Regularization over Multiple Graphs for Similarity Learning
The growth of the web has directly influenced the increase in the availability of relational data. One of the key problems in mining such data is computing the similarity between o...
Pradeep Muthukrishnan, Dragomir R. Radev, Qiaozhu ...
121
Voted
KDD
2002
ACM
175views Data Mining» more  KDD 2002»
16 years 3 months ago
Mining product reputations on the Web
Knowing the reputations of your own and/or competitors' products is important for marketing and customer relationship management. It is, however, very costly to collect and a...
Satoshi Morinaga, Kenji Yamanishi, Kenji Tateishi,...
113
Voted
DMKD
2003
ACM
114views Data Mining» more  DMKD 2003»
15 years 7 months ago
Deriving link-context from HTML tag tree
HTML anchors are often surrounded by text that seems to describe the destination page appropriately. The text surrounding a link or the link-context is used for a variety of tasks...
Gautam Pant