DTD and its instance have been considered the standard for data representation and information exchange format on the current web. However, when coming to the next generation of w...
Text classification categories Web documents in large collections into predefined classes based on their contents. Unfortunately, the classification process can be time-consumi...
: Text classification, document clustering and similar document analysis tasks are currently the subject of significant global research, since such areas underpin web intelligence,...
The value of showing important, yet separated, parts of a document simultaneously motivates head-tail display. 35% of Web documents tested benefit. A head-tail display provides a ...
Daniel Berleant, Jinghao Miao, M. Arvold, J. Brown...
In this paper we process and analyze web search engine query and click data from the perspective of the documents (URL’s) selected. We initially define possible document categor...