Sciweavers

17688 search results - page 119 / 3538
» Data Set Balancing
Sort
View
DATAMINE
2006
127views more  DATAMINE 2006»
15 years 4 months ago
Computing LTS Regression for Large Data Sets
Least trimmed squares (LTS) regression is based on the subset of h cases (out of n) whose least squares t possesses the smallest sum of squared residuals. The coverage h may be se...
Peter Rousseeuw, Katrien van Driessen
HICSS
2008
IEEE
118views Biometrics» more  HICSS 2008»
15 years 11 months ago
Guidelines for Setting Organizational Policies for Data Quality
: From a process perspective, the tasks that individuals carry out within an organization are linked. These linkages are often documented as process flow diagrams that connect the ...
Rajiv M. Dewan, Veda C. Storey
ISMIS
2011
Springer
14 years 7 months ago
Data Access Paths in Processing of Sets of Frequent Itemset Queries
Abstract. Frequent itemset mining can be regarded as advanced database querying where a user specifies the dataset to be mined and constraints to be satisfied by the discovered i...
Piotr Jedrzejczak, Marek Wojciechowski
ESANN
2007
15 years 5 months ago
Learning topology of a labeled data set with the supervised generative gaussian graph
Abstract. Discovering the topology of a set of labeled data in a Euclidian space can help to design better decision systems. In this work, we propose a supervised generative model ...
Pierre Gaillard, Michaël Aupetit, Géra...
BMCBI
2011
14 years 8 months ago
Gene set analysis for longitudinal gene expression data
Background: Gene set analysis (GSA) has become a successful tool to interpret gene expression profiles in terms of biological functions, molecular pathways, or genomic locations. ...
Ke Zhang, Haiyan Wang, Arne C. Bathke, Solomon W. ...