The GUC Goes to TREC 2004: Using Whole or Partial Documents for Retrieval and Classification in the Genomics Track

14 years 1 months ago

Download trec.nist.gov

We were interested in examining the relative effect of using parts of the documents, different combinations of parts of the documents, or whole documents on retrieval and classification. We were also interested in the effect of MeSH terms on retrieval. Our nts show that indexing titles, abstracts, and MeSH terms for adhoc retrieval yielded cally significantly better results than any other part or combination of parts, with abstracts outperforming any other individual part of the documents. In the triage sub-task, using whole s for training a classifier outperformed using titles, abstracts, diagram captions, MeSH terms, and windows of text around gene names. However, training a classifier using the ion of titles, abstracts, and MeSH terms produced results comparable to using whole documents.

Kareem Darwish, Amgad Madkour

Real-time Traffic

Adhoc Retrieval | MeSH Terms | Relative Effect | TREC 2004 | TREC 2008 |

claim paper

Post Info
More Details (n/a)

Added	31 Oct 2010
Updated	31 Oct 2010
Type	Conference
Year	2004
Where	TREC
Authors	Kareem Darwish, Amgad Madkour

Comments (0)

Sciweavers

The GUC Goes to TREC 2004: Using Whole or Partial Documents for Retrieval and Classification in the Genomics Track

Adhoc Retrieval | MeSH Terms | Relative Effect | TREC 2004 | TREC 2008 |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers