Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

174

TREC
2004

127views Information Technology» more TREC 2004»

Experience of Using SVM for the Triage Task in TREC 2004 Genomics Track

15 years 8 months ago

Experience of Using SVM for the Triage Task in TREC 2004 Genomics Track

Download trec.nist.gov

This paper reports our knowledge-ignorant machine learning approach to the triage task in TREC2004 genomics track, which is actually a text categorization problem. We applied Support Vector Machine (SVM) and found that information-gain based feature selection is helpful. Although we achieved decent performance in leave-one-out cross-validation experiments, the evaluation result on the test data turned out to be surprisingly poor. Further experiments revealed that there is a chasm between the training and test data distributions. It seems that more aggressive feature selection can partially alleviate the trouble caused by distribution change. Keywords Text Categorization, Machine Learning, Support Vector Machine, Feature Selection, Distribution Change.

Dell Zhang, Wee Sun Lee

Real-time Traffic

Feature Selection | Machine Learning | Support Vector Machine | TREC 2004 | TREC 2008 |

claim paper

Related Content

» TREC 2004 Genomics Track Experiments at IUB

» UB at TREC 13 Genomics Track

» MeSH Based Feedback Concept Recognition and Stacked Classification for Curation Tasks

» DIMACS at the TREC 2004 Genomics Track

» BioText Team Experiments for the TREC 2004 Genomics Track

» The GUC Goes to TREC 2004 Using Whole or Partial Documents for Retrieval and Classificatio...

» York University at TREC 2004 HARD and Genomics Tracks

» Expanding Queries Using Stems and Symbols

» SJTU at TREC 2004 Web Track Experiments

Post Info
More Details (n/a)

Added	31 Oct 2010
Updated	31 Oct 2010
Type	Conference
Year	2004
Where	TREC
Authors	Dell Zhang, Wee Sun Lee

Comments (0)