Sciweavers

301 search results - page 12 / 61
» Metrics for Mining Multisets
Sort
View
WSDM
2010
ACM
261views Data Mining» more  WSDM 2010»
14 years 5 months ago
Learning Similarity Metrics for Event Identification in Social Media
Social media sites (e.g., Flickr, YouTube, and Facebook) are a popular distribution outlet for users looking to share their experiences and interests on the Web. These sites host ...
Hila Becker, Mor Naaman, Luis Gravano
SISAP
2010
IEEE
135views Data Mining» more  SISAP 2010»
13 years 6 months ago
Improving the similarity search of tandem mass spectra using metric access methods
In biological applications, the tandem mass spectrometry is a widely used method for determining protein and peptide sequences from an ”in vitro” sample. The sequences are not...
Jiri Novák, Tomás Skopal, David Hoks...
KDD
2009
ACM
180views Data Mining» more  KDD 2009»
14 years 8 months ago
Using graph-based metrics with empirical risk minimization to speed up active learning on networked data
Active and semi-supervised learning are important techniques when labeled data are scarce. Recently a method was suggested for combining active learning with a semi-supervised lea...
Sofus A. Macskassy
BMCBI
2010
153views more  BMCBI 2010»
13 years 8 months ago
Automatic symptom name normalization in clinical records of traditional Chinese medicine
Background: In recent years, Data Mining technology has been applied more than ever before in the field of traditional Chinese medicine (TCM) to discover regularities from the exp...
Yaqiang Wang, Zhonghua Yu, Yongguang Jiang, Kaikuo...
ICDE
2008
IEEE
157views Database» more  ICDE 2008»
14 years 9 months ago
OptRR: Optimizing Randomized Response Schemes for Privacy-Preserving Data Mining
The randomized response (RR) technique is a promising technique to disguise private categorical data in Privacy-Preserving Data Mining (PPDM). Although a number of RR-based methods...
Zhengli Huang, Wenliang Du