Sciweavers

361 search results - page 40 / 73
» On Mining Web Access Logs
Sort
View
ICDM
2010
IEEE
125views Data Mining» more  ICDM 2010»
13 years 7 months ago
Mining Arabic Business Reviews
For languages with rich content over the web, business reviews are easily accessible via many known websites, e.g., Yelp.com. For languages with poor content over the web like Arab...
Mohamed Elhawary, Mohamed G. Elfeky
IJWIS
2007
77views more  IJWIS 2007»
13 years 9 months ago
World's first web census
: Purpose — To measure the exact size of the World Wide Web (i.e., a census). The measure used is the number of publicly accessible web servers on port 80. Design/methodology/app...
Darcy G. Benoit, André Trudel
WWW
2006
ACM
14 years 3 months ago
Do not crawl in the DUST: different URLs with similar text
We consider the problem of dust: Different URLs with Similar Text. Such duplicate URLs are prevalent in web sites, as web server software often uses aliases and redirections, and...
Uri Schonfeld, Ziv Bar-Yossef, Idit Keidar
KDD
2003
ACM
161views Data Mining» more  KDD 2003»
14 years 10 months ago
Eliminating noisy information in Web pages for data mining
A commercial Web page typically contains many information blocks. Apart from the main content blocks, it usually has such blocks as navigation panels, copyright and privacy notice...
Lan Yi, Bing Liu, Xiaoli Li
WWW
2003
ACM
14 years 10 months ago
Functionality-Based Web Image Categorization
The World Wide Web provides an increasingly powerful and popular publication mechanism. Web documents often contain a large number of images serving various different purposes. Id...
Jianying Hu, Amit Bagga