Background/purpose: Previously we had identified concurrent genes, which highlighted the interplay between copy number variation (CNV) and differential gene expression (GE) for Han Chinese breast cancers. The merit of the approach is to discovery biomarkers not identifiable by conventional GE only data, for which phenotype-correlation or gene variability is the criteria of gene selection. Materials and methods: Thirty-one comparative genomic hybridization (CGH) and 83 GE microarrays were performed, with 29 breast cancers assayed from both platforms. Potential targets were revealed by Genomic Identification of Significant Targets in Cancer (GISTIC) from CGH arrays. Concurrent genes and genes with significant GISTIC scores were used to derive the extended concurrent genes signature, which was consensus from leading edge analysis across all studies and a supervised partial least square (PLS) regression predictive model of disease-free survival was constructed. Results: There were 1584 concurrent genes from 29 samples with both CGH and GE microarrays. Enriched concurrent genes sets for disease-free survival were identified independently from 83 GE arrays and another one with Han Chinese origin as well as three studies of Western origin. For five studies with disease-free survival follow up, prognostic discrepancy was observed between predicted high-risk and low-risk group patients. Conclusion: We concluded that through parallel analyses of CGH and GE microarrays, the proposed extended concurrent gene expression signature can identify biomarkers with prognostic values.
- Breast cancer
- Extended concurrent genes signature
- Leading edge analysis
- Partial least square regression
ASJC Scopus subject areas