Sciweavers

103 search results - page 8 / 21
» Models and Algorithms for Duplicate Document Detection
Sort
View
WWW
2006
ACM
14 years 8 months ago
Detecting nepotistic links by language model disagreement
In this short note we demonstrate the applicability of hyperlink downweighting by means of language model disagreement. The method filters out hyperlinks with no relevance to the ...
András A. Benczúr, István B&i...
MTA
2007
83views more  MTA 2007»
13 years 6 months ago
A study on multimedia documents behavior: a notion of equivalence
Abstract. In this paper we address the problem of comparing multimedia documents, which can be described according to different reference models. If we consider presentations as c...
Paola Bertolotti, Ombretta Gaggi
MSR
2006
ACM
14 years 1 months ago
Detecting similar Java classes using tree algorithms
Similarity analysis of source code is helpful during development to provide, for instance, better support for code reuse. Consider a development environment that analyzes code whi...
Tobias Sager, Abraham Bernstein, Martin Pinzger, C...
BIRD
2007
Springer
13 years 11 months ago
An Evaluation of Text Retrieval Methods for Similarity Search of Multi-dimensional NMR-Spectra
Abstract. Searching and mining nuclear magnetic resonance (NMR)spectra of naturally occurring substances is an important task to investigate new potentially useful chemical compoun...
Alexander Hinneburg, Andrea Porzel, Karina Wolfram
SIGIR
2005
ACM
14 years 27 days ago
Using term informativeness for named entity detection
Informal communication (e-mail, bulletin boards) poses a difficult learning environment because traditional grammatical and lexical information are noisy. Other information is nec...
Jason D. M. Rennie, Tommi Jaakkola