The world wide web is the largest source for all kind of information currently available. Due to its enormous size retrieving relevant information is a difficult task for which us...
An important and well-studied problem is the production of semantic lexicons from a large corpus. In this paper, we present a system named ASIA (Automatic Set Instance Acquirer), ...
Determining the user intent of Web searches is a difficult problem due to the sparse data available concerning the searcher. In this paper, we examine a method to determine the us...
Bernard J. Jansen, Danielle L. Booth, Amanda Spink
Current technologies aimed at supporting processes – whether it is a business process or a learning process – are usually based on using a dedicated set of metadata to describ...
: Hypertext categorization is the automatic classification of web documents into predefined classes. It poses new challenges for automatic categorization because of the rich inform...