Refinement of breast cancer risk prediction with concordant leading edge subsets from prognostic gene signatures

Chi Cheng Huang, Shih Hsin Tu, Heng Hui Lien, Ching Shui Huang, Chi Jung Huang, Liang Chuan Lai, Mon Hsun Tsai, Eric Y. Chuang

Research output: Contribution to journalArticle

1 Citation (Scopus)

Abstract

Several prognostic signatures have been identified for breast cancer. However, these signatures vary extensively in their gene compositions, and the poor concordance of the risk groups defined by the prognostic signatures hinders their clinical applicability. Breast cancer risk prediction was refined with a novel approach to finding concordant genes from leading edge analysis of prognostic signatures. Each signature was split into two gene sets, which contained either up-regulated or down-regulated genes, and leading edge analysis was performed within each array study for all up-/down-regulated gene sets of the same signature from all training datasets. Consensus of leading edge subsets among all training microarrays was used to synthesize a predictive model, which was then tested in independent studies by partial least squares regression. Only a small portion of six prognostic signatures (Amsterdam, Rotterdam, Genomic Grade Index, Recurrence Score, and Hu306 and PAM50 of intrinsic subtypes) was significantly enriched in the leading edge analysis in five training datasets (n = 2,380), and that the concordant leading edge subsets (43 genes) could identify the core signature genes that account for the enrichment signals providing prognostic power across all assayed samples. The proposed concordant leading edge algorithm was able to discriminate high-risk from low-risk patients in terms of relapse-free or distant metastasis-free survival in all training samples (hazard ratios: 1.84–2.20) and in three out of four independent studies (hazard ratios: 3.91–8.31). In some studies, the concordant leading edge subset remained a significant prognostic factor independent of clinical ER, HER2, and lymph node status. The present study provides a statistical framework for identifying core consensus across microarray studies with leading edge analysis, and a breast cancer risk predictive model was established.

Original languageEnglish
Pages (from-to)353-370
Number of pages18
JournalBreast Cancer Research and Treatment
Volume147
Issue number2
DOIs
Publication statusPublished - Sep 1 2014

Fingerprint

Breast Neoplasms
Genes
Recurrence
Least-Squares Analysis
Lymph Nodes
Neoplasm Metastasis
Survival
Datasets

Keywords

  • Breast cancer
  • Concordant genes
  • Gene expression profiles
  • Leading edge analysis
  • Microarray

ASJC Scopus subject areas

  • Oncology
  • Cancer Research

Cite this

Refinement of breast cancer risk prediction with concordant leading edge subsets from prognostic gene signatures. / Huang, Chi Cheng; Tu, Shih Hsin; Lien, Heng Hui; Huang, Ching Shui; Huang, Chi Jung; Lai, Liang Chuan; Tsai, Mon Hsun; Chuang, Eric Y.

In: Breast Cancer Research and Treatment, Vol. 147, No. 2, 01.09.2014, p. 353-370.

Research output: Contribution to journalArticle

Huang, Chi Cheng ; Tu, Shih Hsin ; Lien, Heng Hui ; Huang, Ching Shui ; Huang, Chi Jung ; Lai, Liang Chuan ; Tsai, Mon Hsun ; Chuang, Eric Y. / Refinement of breast cancer risk prediction with concordant leading edge subsets from prognostic gene signatures. In: Breast Cancer Research and Treatment. 2014 ; Vol. 147, No. 2. pp. 353-370.
@article{b4eab310dd9244f9bfc88f9831002a10,
title = "Refinement of breast cancer risk prediction with concordant leading edge subsets from prognostic gene signatures",
abstract = "Several prognostic signatures have been identified for breast cancer. However, these signatures vary extensively in their gene compositions, and the poor concordance of the risk groups defined by the prognostic signatures hinders their clinical applicability. Breast cancer risk prediction was refined with a novel approach to finding concordant genes from leading edge analysis of prognostic signatures. Each signature was split into two gene sets, which contained either up-regulated or down-regulated genes, and leading edge analysis was performed within each array study for all up-/down-regulated gene sets of the same signature from all training datasets. Consensus of leading edge subsets among all training microarrays was used to synthesize a predictive model, which was then tested in independent studies by partial least squares regression. Only a small portion of six prognostic signatures (Amsterdam, Rotterdam, Genomic Grade Index, Recurrence Score, and Hu306 and PAM50 of intrinsic subtypes) was significantly enriched in the leading edge analysis in five training datasets (n = 2,380), and that the concordant leading edge subsets (43 genes) could identify the core signature genes that account for the enrichment signals providing prognostic power across all assayed samples. The proposed concordant leading edge algorithm was able to discriminate high-risk from low-risk patients in terms of relapse-free or distant metastasis-free survival in all training samples (hazard ratios: 1.84–2.20) and in three out of four independent studies (hazard ratios: 3.91–8.31). In some studies, the concordant leading edge subset remained a significant prognostic factor independent of clinical ER, HER2, and lymph node status. The present study provides a statistical framework for identifying core consensus across microarray studies with leading edge analysis, and a breast cancer risk predictive model was established.",
keywords = "Breast cancer, Concordant genes, Gene expression profiles, Leading edge analysis, Microarray",
author = "Huang, {Chi Cheng} and Tu, {Shih Hsin} and Lien, {Heng Hui} and Huang, {Ching Shui} and Huang, {Chi Jung} and Lai, {Liang Chuan} and Tsai, {Mon Hsun} and Chuang, {Eric Y.}",
year = "2014",
month = "9",
day = "1",
doi = "10.1007/s10549-014-3104-6",
language = "English",
volume = "147",
pages = "353--370",
journal = "Breast Cancer Research and Treatment",
issn = "0167-6806",
publisher = "Springer New York",
number = "2",

}

TY - JOUR

T1 - Refinement of breast cancer risk prediction with concordant leading edge subsets from prognostic gene signatures

AU - Huang, Chi Cheng

AU - Tu, Shih Hsin

AU - Lien, Heng Hui

AU - Huang, Ching Shui

AU - Huang, Chi Jung

AU - Lai, Liang Chuan

AU - Tsai, Mon Hsun

AU - Chuang, Eric Y.

PY - 2014/9/1

Y1 - 2014/9/1

N2 - Several prognostic signatures have been identified for breast cancer. However, these signatures vary extensively in their gene compositions, and the poor concordance of the risk groups defined by the prognostic signatures hinders their clinical applicability. Breast cancer risk prediction was refined with a novel approach to finding concordant genes from leading edge analysis of prognostic signatures. Each signature was split into two gene sets, which contained either up-regulated or down-regulated genes, and leading edge analysis was performed within each array study for all up-/down-regulated gene sets of the same signature from all training datasets. Consensus of leading edge subsets among all training microarrays was used to synthesize a predictive model, which was then tested in independent studies by partial least squares regression. Only a small portion of six prognostic signatures (Amsterdam, Rotterdam, Genomic Grade Index, Recurrence Score, and Hu306 and PAM50 of intrinsic subtypes) was significantly enriched in the leading edge analysis in five training datasets (n = 2,380), and that the concordant leading edge subsets (43 genes) could identify the core signature genes that account for the enrichment signals providing prognostic power across all assayed samples. The proposed concordant leading edge algorithm was able to discriminate high-risk from low-risk patients in terms of relapse-free or distant metastasis-free survival in all training samples (hazard ratios: 1.84–2.20) and in three out of four independent studies (hazard ratios: 3.91–8.31). In some studies, the concordant leading edge subset remained a significant prognostic factor independent of clinical ER, HER2, and lymph node status. The present study provides a statistical framework for identifying core consensus across microarray studies with leading edge analysis, and a breast cancer risk predictive model was established.

AB - Several prognostic signatures have been identified for breast cancer. However, these signatures vary extensively in their gene compositions, and the poor concordance of the risk groups defined by the prognostic signatures hinders their clinical applicability. Breast cancer risk prediction was refined with a novel approach to finding concordant genes from leading edge analysis of prognostic signatures. Each signature was split into two gene sets, which contained either up-regulated or down-regulated genes, and leading edge analysis was performed within each array study for all up-/down-regulated gene sets of the same signature from all training datasets. Consensus of leading edge subsets among all training microarrays was used to synthesize a predictive model, which was then tested in independent studies by partial least squares regression. Only a small portion of six prognostic signatures (Amsterdam, Rotterdam, Genomic Grade Index, Recurrence Score, and Hu306 and PAM50 of intrinsic subtypes) was significantly enriched in the leading edge analysis in five training datasets (n = 2,380), and that the concordant leading edge subsets (43 genes) could identify the core signature genes that account for the enrichment signals providing prognostic power across all assayed samples. The proposed concordant leading edge algorithm was able to discriminate high-risk from low-risk patients in terms of relapse-free or distant metastasis-free survival in all training samples (hazard ratios: 1.84–2.20) and in three out of four independent studies (hazard ratios: 3.91–8.31). In some studies, the concordant leading edge subset remained a significant prognostic factor independent of clinical ER, HER2, and lymph node status. The present study provides a statistical framework for identifying core consensus across microarray studies with leading edge analysis, and a breast cancer risk predictive model was established.

KW - Breast cancer

KW - Concordant genes

KW - Gene expression profiles

KW - Leading edge analysis

KW - Microarray

UR - http://www.scopus.com/inward/record.url?scp=85027957781&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85027957781&partnerID=8YFLogxK

U2 - 10.1007/s10549-014-3104-6

DO - 10.1007/s10549-014-3104-6

M3 - Article

VL - 147

SP - 353

EP - 370

JO - Breast Cancer Research and Treatment

JF - Breast Cancer Research and Treatment

SN - 0167-6806

IS - 2

ER -