Abstract. As the number of web pages increases dramatically, the problem of the information overload becomes more severe when browsing and searching the WWW. To alleviate this prob...
We present an algorithm, witch, that learns to detect spam hosts or pages on the Web. Unlike most other approaches, it simultaneously exploits the structure of the Web graph as we...
Jacob Abernethy, Olivier Chapelle, Carlos Castillo
Web spam is a widely-recognized threat to the quality and security of the Web. Web spam pages pollute search engine indexes, burden Web crawlers and Web mining services, and expos...
We are creating an interactive tool to help non-professional web site builders create high quality designs. We have previously reported that quantitative measures of web page stru...
Institutions and companies that are based in countries where the main language is not English typically publish Web sites that offer the same information at least in the local lan...
Filippo Ricca, Paolo Tonella, Emanuele Pianta, Chr...