Sciweavers

308 search results - page 5 / 62
» Syntactic Similarity of Web Documents
Sort
View
SPIRE
2004
Springer
14 years 12 days ago
Dealing with Syntactic Variation Through a Locality-Based Approach
To date, attempts for applying syntactic information in the document-based retrieval model dominant have led to little practical improvement, mainly due to the problems associated ...
Jesús Vilares Ferro, Miguel A. Alonso
CPM
2000
Springer
177views Combinatorics» more  CPM 2000»
13 years 11 months ago
Identifying and Filtering Near-Duplicate Documents
Abstract. The mathematical concept of document resemblance captures well the informal notion of syntactic similarity. The resemblance can be estimated using a fixed size “sketch...
Andrei Z. Broder
IPM
2008
141views more  IPM 2008»
13 years 7 months ago
Towards a unified approach to document similarity search using manifold-ranking of blocks
Document similarity search (i.e. query by example) aims to retrieve a ranked list of documents similar to a query document in a text corpus or on the Web. Most existing approaches...
Xiaojun Wan, Jianwu Yang, Jianguo Xiao
KES
2010
Springer
13 years 5 months ago
DOCODE-Lite: A Meta-Search Engine for Document Similarity Retrieval
The retrieval of similar documents from large scale datasets has been the one of the main concerns in knowledge management environments, such as plagiarism detection, news impact a...
Felipe Bravo-Marquez, Gaston L'Huillier, Sebasti&a...
IJCNLP
2005
Springer
14 years 17 days ago
A Comparative Study of Language Models for Book and Author Recognition
Abstract. Linguistic information can help improve evaluation of similarity between documents; however, the kind of linguistic information to be used depends on the task. In this pa...
Özlem Uzuner, Boris Katz