Sciweavers

341 search results - page 3 / 69
» Data Cleaning and Semantic Improvement in Biological Databas...
Sort
View
CSWWS
2006
14 years 1 months ago
A Distributed Agent System upon Semantic Web Technologies to Provide Biological Data
Bioinformaticians are accustomed to going through analysis steps, in which they employ several data sources, like protein sequence and protein interaction databases, to carry out t...
Farzad Kohantorabi, Gregory Butler, Christopher J....
DASFAA
2007
IEEE
183views Database» more  DASFAA 2007»
14 years 4 months ago
BioDIFF: An Effective Fast Change Detection Algorithm for Biological Annotations
Abstract. Warehousing heterogeneous, dynamic biological data is a key technique for biological data integration as it greatly improves performance. However, it requires complex mai...
Yang Song, Sourav S. Bhowmick, C. Forbes Dewey
CAISE
2007
Springer
14 years 4 months ago
Declarative XML Data Cleaning with XClean
Data cleaning is the process of correcting anomalies in a data source, that may for instance be due to typographical errors, or duplicate representations of an entity. It is a cruc...
Melanie Weis, Ioana Manolescu
KDD
2008
ACM
176views Data Mining» more  KDD 2008»
14 years 10 months ago
Febrl -: an open source data cleaning, deduplication and record linkage system with a graphical user interface
Matching records that refer to the same entity across databases is becoming an increasingly important part of many data mining projects, as often data from multiple sources needs ...
Peter Christen
ICDE
2007
IEEE
104views Database» more  ICDE 2007»
14 years 11 months ago
Indexing Uncertain Categorical Data
Uncertainty in categorical data is commonplace in many applications, including data cleaning, database integration, and biological annotation. In such domains, the correct value o...
Sarvjeet Singh, Chris Mayfield, Sunil Prabhakar, R...