Viability of in-house datamarting approaches for population genetics analysis of snp genotypes

15 years 8 months ago

Download www.biomedcentral.com

Background: Databases containing very large amounts of SNP (Single Nucleotide Polymorphism) data are now freely available for researchers interested in medical and/or population genetics applications. While many of these SNP repositories have implemented data retrieval tools for general-purpose mining, these alone cannot cover the broad spectrum of needs of most medical and population genetics studies. Results: To address this limitation, we have built in-house customized data marts from the raw data provided by the largest public databases. In particular, for population genetics analysis based on genotypes we have built a set of data processing scripts that deal with raw data coming from the major SNP variation databases (e.g. HapMap, Perlegen), stripping them into single genotypes and then grouping them into populations, then merged with additional complementary descriptive information extracted from dbSNP. This allows not only in-house standardization and normalization of the genot...

Jorge Amigo, Christopher Phillips, Antonio Salas

Real-time Traffic

CIKM 2008 | Information Management | Population Genetics | Raw Data | SNP Repositories |

claim paper

Added	12 Oct 2010
Updated	12 Oct 2010
Type	Conference
Year	2008
Where	CIKM
Authors	Jorge Amigo, Christopher Phillips, Antonio Salas

Sciweavers

Viability of in-house datamarting approaches for population genetics analysis of snp genotypes

CIKM 2008 | Information Management | Population Genetics | Raw Data | SNP Repositories |

Explore & Download

Productivity Tools

Sciweavers