Search Sciweavers | Sciweavers

329 search results - page 10 / 66

» A Novel Method for Detecting Similar Documents

179

click to vote

NIPS
2004

109views Information Technology» more NIPS 2004»

A Probabilistic Model for Online Document Clustering with Application to Novelty Detection

15 years 7 months ago

Download www.gatsby.ucl.ac.uk

In this paper we propose a probabilistic model for online document clustering. We use non-parametric Dirichlet process prior to model the growing number of clusters, and use a pri...

Jian Zhang 0003, Zoubin Ghahramani, Yiming Yang

claim paper

Read More »

171

click to vote

ACL
2009

133views Computational Linguistics» more ACL 2009»

Summarizing multiple spoken documents: finding evidence from untranscribed audio

15 years 3 months ago

Download www.aclweb.org

This paper presents a model for summarizing multiple untranscribed spoken documents. Without assuming the availability of transcripts, the model modifies a recently proposed unsup...

Xiaodan Zhu, Gerald Penn, Frank Rudzicz

claim paper

Read More »

169

click to vote

WISE
2005
Springer

106views Internet Technology» more WISE 2005»

Document Re-ranking by Generality in Bio-medical Information Retrieval

15 years 11 months ago

Download kmi.open.ac.uk

Document ranking is well known to be a crucial process in information retrieval (IR). It presents retrieved documents in an order of their estimated degrees of relevance to query. ...

Xin Yan, Xue Li, Dawei Song

claim paper

Read More »

166

click to vote

FLAIRS
2006

134views Artificial Intelligence» more FLAIRS 2006»

Corpus Based Unsupervised Labeling of Documents

15 years 7 months ago

Download www.aaai.org

Text categorization involves mapping of documents to a fixed set of labels. A similar but equally important problem is that of assigning labels to large corpora. With a deluge of ...

Delip Rao, Deepak P, Deepak Khemani

claim paper

Read More »

172

click to vote

HT
2010
ACM

219views Internet Technology» more HT 2010»

Citation based plagiarism detection: a new approach to identify plagiarized work language independently

15 years 3 months ago

Download www.sciplore.org

This paper describes a new approach towards detecting plagiarism and scientific documents that have been read but not cited. In contrast to existing approaches, which analyze docu...

Bela Gipp, Jöran Beel

claim paper

Read More »

« Prev « First page 10 / 66 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers