It is crucial for a web crawler to distinguish between ephemeral and persistent content. Ephemeral content (e.g., quote of the day) is usually not worth crawling, because by the t...
Software evolution research is limited by the amount of information available to researchers: Current version control tools do not store all the information generated by developer...
In this paper, we introduce a new instance-based approach to the label ranking problem. This approach is based on a probability model on rankings which is known as the Mallows mode...
Querying by Visual Thesaurus (VT) is a novel paradigm for content-based image retrieval approaches for it gives the user the possibility, in case of inappropriate starting example...
In this paper, the task of text segmentation is approached from a topic modeling perspective. We investigate the use of latent Dirichlet allocation (LDA) topic model to segment a ...