Sciweavers

1034 search results - page 195 / 207
» A Bayesian Metric for Evaluating Machine Learning Algorithms
Sort
View
KDD
2008
ACM
163views Data Mining» more  KDD 2008»
14 years 7 months ago
The cost of privacy: destruction of data-mining utility in anonymized data publishing
Re-identification is a major privacy threat to public datasets containing individual records. Many privacy protection algorithms rely on generalization and suppression of "qu...
Justin Brickell, Vitaly Shmatikov
CIKM
2009
Springer
14 years 1 months ago
Joint sentiment/topic model for sentiment analysis
Sentiment analysis or opinion mining aims to use automated tools to detect subjective information such as opinions, attitudes, and feelings expressed in text. This paper proposes ...
Chenghua Lin, Yulan He
CIKM
2009
Springer
14 years 1 months ago
Cross-language linking of news stories on the web using interlingual topic modelling
We have studied the problem of linking event information across different languages without the use of translation systems or dictionaries. The linking is based on interlingua in...
Wim De Smet, Marie-Francine Moens
SIGIR
2006
ACM
14 years 1 months ago
LDA-based document models for ad-hoc retrieval
Search algorithms incorporating some form of topic model have a long history in information retrieval. For example, cluster-based retrieval has been studied since the 60s and has ...
Xing Wei, W. Bruce Croft
IDEAL
2005
Springer
14 years 27 days ago
Probabilistic Data Generation for Deduplication and Data Linkage
Abstract. In many data mining projects the data to be analysed contains personal information, like names and addresses. Cleaning and preprocessing of such data likely involves dedu...
Peter Christen