Sciweavers

CIKM
2008
Springer
13 years 10 months ago
Generalized inverse document frequency
Inverse document frequency (IDF) is one of the most useful and widely used concepts in information retrieval. There have been various attempts to provide theoretical justification...
Donald Metzler
CIKM
2008
Springer
13 years 10 months ago
Data degradation: making private data less sensitive over time
Trail disclosure is the leakage of privacy sensitive data, resulting from negligence, attack or abusive scrutinization or usage of personal digital trails. To prevent trail disclo...
Nicolas Anciaux, Luc Bouganim, Harold van Heerde, ...
CIKM
2008
Springer
13 years 10 months ago
Modeling document features for expert finding
We argue that expert finding is sensitive to multiple document features in an organization, and therefore, can benefit from the incorporation of these document features. We propos...
Jianhan Zhu, Dawei Song, Stefan M. Rüger, Xia...
CIKM
2008
Springer
13 years 10 months ago
Using the current browsing context to improve search relevance
In this paper, we investigate the problem of improving the relevance of a Web search engine by adapting it to the dynamic needs of the user. We examine a representative case of su...
Mandar Rahurkar, Silviu Cucerzan
CIKM
2008
Springer
13 years 10 months ago
Categorizing blogger's interests based on short snippets of blog posts
Blogs have become an important medium for people to express opinions and share information on the web. Predicting the interests of bloggers can be beneficial for information retri...
Jiahui Liu, Larry Birnbaum, Bryan Pardo
CIKM
2008
Springer
13 years 10 months ago
Scalable complex pattern search in sequential data
Searching data streams has been traditionally very limited, either in the complexity of the search or in the size of the searched dataset. In this paper, we investigate the design...
Leila Kaghazian, Dennis McLeod, Reza Sadri
CIKM
2008
Springer
13 years 10 months ago
Exploiting context to detect sensitive information in call center conversations
Protecting sensitive information while preserving the shareability and usability of data is becoming increasingly important. In call-centers a lot of customer related sensitive in...
Tanveer A. Faruquie, Sumit Negi, Anup Chalamalla, ...
CIKM
2008
Springer
13 years 10 months ago
A survey of pre-retrieval query performance predictors
The focus of research on query performance prediction is to predict the effectiveness of a query given a search system and a collection of documents. If the performance of queries...
Claudia Hauff, Djoerd Hiemstra, Franciska de Jong
CIKM
2008
Springer
13 years 10 months ago
Scaling up duplicate detection in graph data
Duplicate detection determines different representations of realworld objects in a database. Recent research has considered the use of relationships among object representations t...
Melanie Herschel, Felix Naumann