Search Sciweavers | Sciweavers

180 search results - page 5 / 36

» A Method for Calculating Term Similarity on Large Document C...

click to vote

DRR
2008

141views Document Analysis» more DRR 2008»

Hybrid approach combining contextual and statistical information for identifying MEDLINE citation terms

13 years 9 months ago

Download lhncbc.nlm.nih.gov

There is a strong demand for developing automated tools for extracting pertinent information from the biomedical literature that is a rich, complex, and dramatically growing resou...

In-Cheol Kim, Daniel X. Le, George R. Thoma

claim paper

Read More »

click to vote

ICCS
2009
Springer

107views Applied Computing» more ICCS 2009»

Frequent Itemset Mining for Clustering Near Duplicate Web Documents

14 years 2 months ago

Download www.mendeley.com

A vast amount of documents in the Web have duplicates, which is a challenge for developing eﬃcient methods that would compute clusters of similar documents. In this paper we use ...

Dmitry I. Ignatov, Sergei O. Kuznetsov

claim paper

Read More »

click to vote

CIKM
2007
Springer

87views Information Technology» more CIKM 2007»

Semiautomatic evaluation of retrieval systems using document similarities

14 years 1 months ago

Download ciir.cs.umass.edu

Taking advantage of the well-known cluster hypothesis that “closely associated documents tend to be relevant to the same request”, we can use inter-document similarity to prov...

Ben Carterette, James Allan

claim paper

Read More »

click to vote

SIGIR
2008
ACM

176views Information Technology» more SIGIR 2008»

SpotSigs: robust and efficient near duplicate detection in large web collections

13 years 7 months ago

Download ilpubs.stanford.edu

Motivated by our work with political scientists who need to manually analyze large Web archives of news sites, we present SpotSigs, a new algorithm for extracting and matching sig...

Martin Theobald, Jonathan Siddharth, Andreas Paepc...

claim paper

Read More »

click to vote

SAC
2011
ACM

157views Applied Computing» more SAC 2011»

Biomedical concept extraction based on combining the content-based and word order similarities

12 years 10 months ago

Download www.irit.fr

It is well known that the main objective of conceptual retrieval models is to go beyond simple term matching by relaxing term independence assumption through concept recognition. ...

Duy Dinh, Lynda Tamine

claim paper

Read More »

« Prev « First page 5 / 36 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers