Search Sciweavers | Sciweavers

123

Voted

CICLING
2010
Springer

174views Natural Language Processing» more CICLING 2010»

Word Length n-Grams for Text Re-use Detection

15 years 6 months ago

Abstract. The automatic detection of shared content in written documents –which includes text reuse and its unacknowledged commitment, plagiarism– has become an important probl...

Alberto Barrón-Cedeño, Chiara Basile...

claim paper

Read More »

141

Voted

CIKM
2011
Springer

188views Information Technology» more CIKM 2011»

Lower-bounding term frequency normalization

14 years 2 months ago

Download sifaka.cs.uiuc.edu

In this paper, we reveal a common deﬁciency of the current retrieval models: the component of term frequency (TF) normalization by document length is not lower-bounded properly;...

Yuanhua Lv, ChengXiang Zhai

claim paper

Read More »

143

Voted

CIKM
2004
Springer

170views Information Technology» more CIKM 2004»

InfoAnalyzer: a computer-aided tool for building enterprise taxonomies

15 years 6 months ago

Download domino.research.ibm.com

In this paper we study the problem of collecting training samples for building enterprise taxonomies. We develop a computer-aided tool named InfoAnalyzer, which can effectively as...

Li Zhang, Shixia Liu, Yue Pan, Liping Yang

claim paper

Read More »

111

Voted

ECIR
2003
Springer

94views Information Technology» more ECIR 2003»

Hierarchical Indexing and Flexible Element Retrieval for Structured Document

15 years 4 months ago

Download research.microsoft.com

As more and more structured documents, such as SGML or XML documents become available on the Web, there is a growing demand to develop effective structured document retrieval which...

Hang Cui, Ji-Rong Wen, Tat-Seng Chua

claim paper

Read More »

116

Voted

TREC
1997

74views Information Technology» more TREC 1997»

Short Queries, Natural Language and Spoken Document Retrieval: Experiments at Glasgow University

15 years 4 months ago

Download dis.shef.ac.uk

This paper contains a description of the methodology and results of the three TREC submissions made by the Glasgow IR group (glair). In addition to submitting to the ad hoc task, ...

Fabio Crestani, Mark Sanderson, Marcos Theophylact...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers