A Data Mining Approach for the Detection of High-Risk Breast Cancer Groups

15 years 5 months ago

Download kdbio.inesc-id.pt

It is widely agreed that complex diseases are typically caused by the joint effects of multiple instead of a single genetic variation. These genetic variations may show very little effect individually but strong effect if they occur jointly, a phenomenon known as epistasis or multilocus interaction. In this work, we explore the applicability of decision trees to this problem. A case-control study was performed, composed of 164 controls and 94 cases with 32 SNPs available from the BRCA1, BRCA2 and TP53 genes. There was also information about tobacco and alcohol consumption. We used a Decision Tree to find a group with high-susceptibility of suffering from breast cancer. Our goal was to find one or more leaves with a high percentage of cases and small percentage of controls. To statistically validate the association found, permutation tests were used. We found a high-risk breast cancer group composed of 13 cases and only 1 control, with a Fisher Exact Test value of 9.7

Orlando Anunciação, Bruno C. Gomes,

Real-time Traffic

Breast Cancer | Emerging Technology | Genetic Variations | ISAMI 2010 | Permutation Tests |

claim paper

Post Info
More Details (n/a)

Added	13 Feb 2011
Updated	13 Feb 2011
Type	Journal
Year	2010
Where	ISAMI
Authors	Orlando Anunciação, Bruno C. Gomes, Susana Vinga, Jorge Gaspar, Arlindo L. Oliveira, José Rueff

Comments (0)

Sciweavers

A Data Mining Approach for the Detection of High-Risk Breast Cancer Groups

Breast Cancer | Emerging Technology | Genetic Variations | ISAMI 2010 | Permutation Tests |

Explore & Download

Productivity Tools

Sciweavers