Sciweavers

684 search results - page 78 / 137
» Elimination of Redundant Information for Web Data Mining
Sort
View
KDD
2008
ACM
183views Data Mining» more  KDD 2008»
14 years 8 months ago
De-duping URLs via rewrite rules
A large fraction of the URLs on the web contain duplicate (or near-duplicate) content. De-duping URLs is an extremely important problem for search engines, since all the principal...
Anirban Dasgupta, Ravi Kumar, Amit Sasturkar
ACMSE
2004
ACM
14 years 1 months ago
Topic-based clustering of news articles
Recent years have witnessed an explosion in the availability of news articles on the World Wide Web. Although searchengines’ algorithms have made it easier to locate these docum...
Najaf Ali Shah, Ehab M. ElBahesh
NAR
2011
256views Computer Vision» more  NAR 2011»
12 years 10 months ago
COSMIC: mining complete cancer genomes in the Catalogue of Somatic Mutations in Cancer
COSMIC (http://www.sanger.ac.uk/cosmic) curates comprehensive information on somatic mutations in human cancer. Release v48 (July 2010) describes over 136 000 coding mutations in ...
Simon A. Forbes, Nidhi Bindal, Sally Bamford, Char...
PAAPP
2007
109views more  PAAPP 2007»
13 years 7 months ago
Relation rule mining
\Web users are nowadays confronted with the huge variety of available information sources whose content is not targeted at any specific group or layer. Recommendation systems aim...
Mehdi Adda, Rokia Missaoui, Petko Valtchev
WWW
2006
ACM
14 years 8 months ago
Image annotation using search and mining technologies
In this paper, we present a novel solution to the image annotation problem which annotates images using search and data mining technologies. An accurate keyword is required to ini...
Xin-Jing Wang, Lei Zhang, Feng Jing, Wei-Ying Ma