The paper describes some innovations related to the ongoing work on the GSA prototype, an integrated information retrieval agent. In order to improve the original system effective...
Giovambattista Ianni, Francesco Ricca, Francesco C...
The next version of XHTML is at work-in-progress stage in the World Wide Web Consortium. It adds a lot of features to the most used content language of the Web. The most notable ch...
The paper stems from the idea that maybe the painstainkingly slow adoption of the Semantic Web into the mainstream www can be accelerated by taking clues from these tiny Semantic ...
Studying Web graphs is often difficult due to their large size. Recently, several proposals have been published about various techniques that allow to store a Web graph in memory ...
Automatically generated HTML, as produced by WYSIWYG programs, typically contains much repetitive and unnecessary markup. This paper identifies aspects of such HTML that may be al...