Information Technology

Abstract In this paper, we describe a novel approach to intrinsic plagiarism detection. Each suspicious document is divided into a series of consecutive, potentially overlapping �...

Mike Kestemont, Kim Luyckx, Walter Daelemans

claim paper

Read More »

243

click to vote

CLEF
2011
Springer

187views Information Technology» more CLEF 2011»

Simulation of Within-Session Query Variations Using a Text Segmentation Approach

14 years 7 months ago

Download doras.dcu.ie

Abstract. We propose a generative model for automatic query reformulations from an initial query using the underlying subtopic structure of top ranked retrieved documents. We addre...

Debasis Ganguly, Johannes Leveling, Gareth J. F. J...

claim paper

Read More »

210

click to vote

CLEF
2011
Springer

234views Information Technology» more CLEF 2011»

External Plagiarism Detection using Information Retrieval and Sequence Alignment - Notebook for PAN at CLEF 2011

14 years 7 months ago

Download www.uni-weimar.de

Abstract This paper describes the University of Shefﬁeld entry for the 3rd International Competition on Plagiarism Detection which attempted the monolingual external plagiarism d...

Rao Muhammad Adeel Nawab, Mark Stevenson, Paul D. ...

claim paper

Read More »

203

click to vote

CLEF
2011
Springer

236views Information Technology» more CLEF 2011»

Overview of the 2nd International Competition on Wikipedia Vandalism Detection

14 years 7 months ago

Download www.uni-weimar.de

Abstract The paper overviews the vandalism detection task of the PAN’11 competition. A new corpus is introduced which comprises about 30 000 Wikipedia edits in the languages Engl...

Martin Potthast, Teresa Holfeld

claim paper

Read More »

251

click to vote

CIKM
2011
Springer

200views Information Technology» more CIKM 2011»

Improved answer ranking in social question-answering portals

14 years 7 months ago

Download www.stefanriezler.com

Community QA portals provide an important resource for non-factoid question-answering. The inherent noisiness of user-generated data makes the identiﬁcation of high-quality cont...

Felix Hieber, Stefan Riezler

claim paper

Read More »

236

click to vote

CIKM
2011
Springer

230views Information Technology» more CIKM 2011»

Detecting anomalies in graphs with numeric labels

14 years 7 months ago

Download www.cs.qub.ac.uk

This paper presents Yagada, an algorithm to search labelled graphs for anomalies using both structural data and numeric attributes. Yagada is explained using several security-rela...

Michael Davis, Weiru Liu, Paul Miller, George Redp...

claim paper

Read More »

240

click to vote

CIKM
2011
Springer

185views Information Technology» more CIKM 2011»

Estimating selectivity for joined RDF triple patterns

14 years 7 months ago

Download www.it.swin.edu.au

A fundamental problem related to RDF query processing is selectivity estimation, which is crucial to query optimization for determining a join order of RDF triple patterns. In thi...

Hai Huang 0003, Chengfei Liu

claim paper

Read More »

244

Voted

CIKM
2011
Springer

253views Information Technology» more CIKM 2011»

Simultaneous joint and conditional modeling of documents tagged from two perspectives

14 years 7 months ago

Download www.acsu.buffalo.edu

This paper explores correspondence and mixture topic modeling of documents tagged from two diﬀerent perspectives. There has been ongoing work in topic modeling of documents with...

Pradipto Das, Rohini K. Srihari, Yun Fu

claim paper

Read More »

225

click to vote

CIKM
2011
Springer

188views Information Technology» more CIKM 2011»

Lower-bounding term frequency normalization

14 years 7 months ago

Download sifaka.cs.uiuc.edu

In this paper, we reveal a common deﬁciency of the current retrieval models: the component of term frequency (TF) normalization by document length is not lower-bounded properly;...

Yuanhua Lv, ChengXiang Zhai

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers