Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

91

TREC
2004

favoriteEmaildiscussreport

128views Information Technology» more TREC 2004»

Columbia University in the Novelty Track at TREC 2004

15 years 2 months ago

Columbia University in the Novelty Track at TREC 2004

Download trec.nist.gov

Our system for the Novelty Track at TREC 2004 looks beyond sentence boundaries as well as within sentences to identify novel, nonduplicative passages. It tries to identify text spans of two or more sentences that encompass mini-segments of new information. At the same time, we avoid any pairwise comparison of sentences, but rely on the presence of previously unseen terms to provide evidence of novelty. The system is guided by a number of parameters, both weights and thresholds, that are learned automatically with a randomized hill-climbing algorithm. During learning, we varied the target function to produce configurations that emphasize either precision or recall. We also implemented a straightforward vector-space model as a comparison and to test a combined approach.

Barry Schiffman, Kathleen McKeown

Real-time Traffic

Nonduplicative Passages | Pairwise Comparison | Sentence Boundaries | TREC 2004 | TREC 2008 |

claim paper

Related Content

» The University of Michigan in Novelty 2004

» Experiments in Terabyte Searching Genomic Retrieval and Novelty Detection for TREC 2004

» University of Lethbridges Participation in TREC 2004 QA Track

» Sheffield University and the TREC 2004 Genomics Track Query Expansion Using Synonymous Ter...

» Meiji University Web and Novelty Track Experiments at TREC 2003

» Overview of the TREC 2004 Novelty Track

» Novelty Question Answering and Genomics The University of Iowa Response

» ISI Novelty Track System for TREC 2004

» UMass at TREC 2004 Novelty and HARD

Post Info
More Details (n/a)

Added	31 Oct 2010
Updated	31 Oct 2010
Type	Conference
Year	2004
Where	TREC
Authors	Barry Schiffman, Kathleen McKeown

Comments (0)