Sciweavers

1260 search results - page 150 / 252
» Data Quality in Genome Databases
Sort
View
SIGMOD
2007
ACM
169views Database» more  SIGMOD 2007»
14 years 9 months ago
Genome-scale disk-based suffix tree indexing
With the exponential growth of biological sequence databases, it has become critical to develop effective techniques for storing, querying, and analyzing these massive data. Suffi...
Benjarath Phoophakdee, Mohammed J. Zaki
KDD
2005
ACM
205views Data Mining» more  KDD 2005»
14 years 2 months ago
Feature bagging for outlier detection
Outlier detection has recently become an important problem in many industrial and financial applications. In this paper, a novel feature bagging approach for detecting outliers in...
Aleksandar Lazarevic, Vipin Kumar
EKAW
2010
Springer
13 years 7 months ago
Ontology Engineering with Rough Concepts and Instances
A scenario in ontology development and its use is hypothesis testing, such as finding new subconcepts based on the data linked to the ontology. During such experimentation, knowle...
C. Maria Keet
BMCBI
2005
112views more  BMCBI 2005»
13 years 9 months ago
Visualization methods for statistical analysis of microarray clusters
Background: The most common method of identifying groups of functionally related genes in microarray data is to apply a clustering algorithm. However, it is impossible to determin...
Matthew A. Hibbs, Nathaniel C. Dirksen, Kai Li, Ol...
AUSDM
2007
Springer
102views Data Mining» more  AUSDM 2007»
14 years 1 months ago
A Two-Step Classification Approach to Unsupervised Record Linkage
Linking or matching databases is becoming increasingly important in many data mining projects, as linked data can contain information that is not available otherwise, or that woul...
Peter Christen