Over the last few years, blogs (web logs) have gained massive popularity and have become one of the most influential web social media in our times. Every blog post in the Blogosph...
This paper describes a word stemming algorithm for the Spanish Language. Experiments in document retrieval regarding English text suggest that word stemming based on morphological...
This paper focuses on analyzing (Japanese) splogs based on various characteristics of keywords contained in them. We estimate the behavior of spammers when creating splogs from ot...
Most prior work on information extraction has focused on extracting information from text in digital documents. However, often, the most important information being reported in an...
The representation of information collections needs to be optimized for human cognition. While documents often include rich visual components, collections, including personal coll...