Pregnancy Associated Breast Cancer Gene Expressions : New Insights on Their Regulation Based on Rare Correlated Patterns

Pregnancy Associated Breast Cancer Gene Expressions : New Insights on Their Regulation Based on Rare Correlated Patterns

Abstract

Breast-cancer (BC) is the most common invasive cancer in women, with considerable death. Given that, BC is classified as a hormone-dependent cancer, when it collides with pregnancy, different questions may arise for which there are still no convincing answers. To deal with this issue, two new frameworks are proposed within this paper: CoRaM and Dist-CoRaM. The former is the first unified framework dedicated to the extraction of a generic basis of Correlated-Rare Association rules from gene expression data. The proposed approach has been successfully applied on a breast-cancer Gene Expression Matrix (GSE1379) with very promising results. The latter, the Dist-CoRaM approach, is a big-data processing based on Apache spark framework, dealing with correlation mining from micro-array pregnancy associated breast-cancer assays (PABC) data. It is successfully applied on the (GSE31192) gene expression matrix (GEM). The correlated patterns of gene-sets shed light on the fact that PABC exhibits heightened aggressiveness compared to cancers for Non-PABC women. Our findings suggest that higher levels of estrogen and progesterone hormones, unfortunately, are very keen to the increase of the tumor aggressiveness and the proliferation of the cancer.