- Abdalla, Moustafa;
- Tran-Thanh, Danh;
- Moreno, Juan;
- Iakovlev, Vladimir;
- Nair, Ranju;
- Kanwar, Nisha;
- Abdalla, Mohamed;
- Lee, Jennifer PY;
- Kwan, Jennifer Yin Yee;
- Cawthorn, Thomas R;
- Warren, Keisha;
- Arneson, Nona;
- Wang, Dong-Yu;
- Fox, Natalie S;
- Youngson, Bruce J;
- Miller, Naomi A;
- Easson, Alexandra M;
- McCready, David;
- Leong, Wey L;
- Boutros, Paul C;
- Done, Susan J
Almost all genomic studies of breast cancer have focused on well-established tumours because it is technically challenging to study the earliest mutational events occurring in human breast epithelial cells. To address this we created a unique dataset of epithelial samples ductoscopically obtained from ducts leading to breast carcinomas and matched samples from ducts on the opposite side of the nipple. Here, we demonstrate that perturbations in mRNA abundance, with increasing proximity to tumour, cannot be explained by copy number aberrations. Rather, we find a possibility of field cancerization surrounding the primary tumour by constructing a classifier that evaluates where epithelial samples were obtained relative to a tumour (cross-validated micro-averaged AUC = 0.74). We implement a spectral co-clustering algorithm to define biclusters. Relating to over-represented bicluster pathways, we further validate two genes with tissue microarrays and in vitro experiments. We highlight evidence suggesting that bicluster perturbation occurs early in tumour development.