Sciweavers

117 search results - page 6 / 24
» Data Provenance: A Categorization of Existing Approaches
Sort
View
SIGMOD
2008
ACM
167views Database» more  SIGMOD 2008»
14 years 8 months ago
Efficient lineage tracking for scientific workflows
Data lineage and data provenance are key to the management of scientific data. Not knowing the exact provenance and processing pipeline used to produce a derived data set often re...
Thomas Heinis, Gustavo Alonso
JMLR
2008
111views more  JMLR 2008»
13 years 8 months ago
Ranking Categorical Features Using Generalization Properties
Feature ranking is a fundamental machine learning task with various applications, including feature selection and decision tree learning. We describe and analyze a new feature ran...
Sivan Sabato, Shai Shalev-Shwartz
PAKDD
2010
ACM
158views Data Mining» more  PAKDD 2010»
14 years 1 months ago
Integrative Parameter-Free Clustering of Data with Mixed Type Attributes
Abstract. Integrative mining of heterogeneous data is one of the major challenges for data mining in the next decade. We address the problem of integrative clustering of data with ...
Christian Böhm, Sebastian Goebl, Annahita Osw...
HICSS
2003
IEEE
179views Biometrics» more  HICSS 2003»
14 years 1 months ago
Approaches of Wireless TCP Enhancement and A New Proposal Based on Congestion Coherence
TCP is known to have poor performance over unreliable wireless links where packet losses due to transmission errors are misinterpreted as indications of network congestion. TCP en...
Chunlei Liu, Raj Jain
SDM
2007
SIAM
204views Data Mining» more  SDM 2007»
13 years 10 months ago
Flexible Anonymization For Privacy Preserving Data Publishing: A Systematic Search Based Approach
k-anonymity is a popular measure of privacy for data publishing: It measures the risk of identity-disclosure of individuals whose personal information are released in the form of ...
Bijit Hore, Ravi Chandra Jammalamadaka, Sharad Meh...