The widespread use of templates on the Web is considered harmful for two main reasons. Not only do they compromise the relevance judgment of many web IR and web mining methods suc...
Karane Vieira, Altigran Soares da Silva, Nick Pint...
Web spam, which refers to any deliberate actions bringing to selected web pages an unjustifiable favorable relevance or importance, is one of the major obstacles for high quality ...
Abstract. WebML (Web Modelling Language) is a language for the design of dataintensive Web sites. It is supported by visual tools allowing the definition of the conceptual data or...
Due to the growing importance of the World Wide Web, archiving it has become crucial for preserving useful source of information. To maintain a web archive up-to-date, crawlers ha...
Exploiting the complex structure of relational data enables to build better models by taking into account the additional information provided by the links between objects. We exten...