A novel approach for discovering condition-specific correlations of gene expressions within biological pathways by using cloud computing technology

Tzu Hao Chang, Shih Lin Wu, Wei Jen Wang, Jorng Tzong Horng, Cheng Wei Chang

Research output: Contribution to journalArticle

2 Citations (Scopus)

Abstract

Microarrays are widely used to assess gene expressions. Most microarray studies focus primarily on identifying differential gene expressions between conditions (e.g., cancer versus normal cells), for discovering the major factors that cause diseases. Because previous studies have not identified the correlations of differential gene expression between conditions, crucial but abnormal regulations that cause diseases might have been disregarded. This paper proposes an approach for discovering the condition-specific correlations of gene expressions within biological pathways. Because analyzing gene expression correlations is time consuming, an Apache Hadoop cloud computing platform was implemented. Three microarray data sets of breast cancer were collected from the Gene Expression Omnibus, and pathway information from the Kyoto Encyclopedia of Genes and Genomes was applied for discovering meaningful biological correlations. The results showed that adopting the Hadoop platform considerably decreased the computation time. Several correlations of differential gene expressions were discovered between the relapse and nonrelapse breast cancer samples, and most of them were involved in cancer regulation and cancer-related pathways. The results showed that breast cancer recurrence might be highly associated with the abnormal regulations of these gene pairs, rather than with their individual expression levels. The proposed method was computationally efficient and reliable, and stable results were obtained when different data sets were used. The proposed method is effective in identifying meaningful biological regulation patterns between conditions.

Original languageEnglish
Article number763237
JournalBioMed Research International
Volume2014
DOIs
Publication statusPublished - 2014

Fingerprint

Cloud computing
Gene expression
Technology
Gene Expression
Microarrays
Genes
Breast Neoplasms
Encyclopedias
Recurrence
Neoplasms
Cloud Computing
Genome

ASJC Scopus subject areas

  • Biochemistry, Genetics and Molecular Biology(all)
  • Immunology and Microbiology(all)

Cite this

A novel approach for discovering condition-specific correlations of gene expressions within biological pathways by using cloud computing technology. / Chang, Tzu Hao; Wu, Shih Lin; Wang, Wei Jen; Horng, Jorng Tzong; Chang, Cheng Wei.

In: BioMed Research International, Vol. 2014, 763237, 2014.

Research output: Contribution to journalArticle

@article{3199a55476b64f969e2cb06c0df4b2b1,
title = "A novel approach for discovering condition-specific correlations of gene expressions within biological pathways by using cloud computing technology",
abstract = "Microarrays are widely used to assess gene expressions. Most microarray studies focus primarily on identifying differential gene expressions between conditions (e.g., cancer versus normal cells), for discovering the major factors that cause diseases. Because previous studies have not identified the correlations of differential gene expression between conditions, crucial but abnormal regulations that cause diseases might have been disregarded. This paper proposes an approach for discovering the condition-specific correlations of gene expressions within biological pathways. Because analyzing gene expression correlations is time consuming, an Apache Hadoop cloud computing platform was implemented. Three microarray data sets of breast cancer were collected from the Gene Expression Omnibus, and pathway information from the Kyoto Encyclopedia of Genes and Genomes was applied for discovering meaningful biological correlations. The results showed that adopting the Hadoop platform considerably decreased the computation time. Several correlations of differential gene expressions were discovered between the relapse and nonrelapse breast cancer samples, and most of them were involved in cancer regulation and cancer-related pathways. The results showed that breast cancer recurrence might be highly associated with the abnormal regulations of these gene pairs, rather than with their individual expression levels. The proposed method was computationally efficient and reliable, and stable results were obtained when different data sets were used. The proposed method is effective in identifying meaningful biological regulation patterns between conditions.",
author = "Chang, {Tzu Hao} and Wu, {Shih Lin} and Wang, {Wei Jen} and Horng, {Jorng Tzong} and Chang, {Cheng Wei}",
year = "2014",
doi = "10.1155/2014/763237",
language = "English",
volume = "2014",
journal = "BioMed Research International",
issn = "2314-6133",
publisher = "Hindawi Publishing Corporation",

}

TY - JOUR

T1 - A novel approach for discovering condition-specific correlations of gene expressions within biological pathways by using cloud computing technology

AU - Chang, Tzu Hao

AU - Wu, Shih Lin

AU - Wang, Wei Jen

AU - Horng, Jorng Tzong

AU - Chang, Cheng Wei

PY - 2014

Y1 - 2014

N2 - Microarrays are widely used to assess gene expressions. Most microarray studies focus primarily on identifying differential gene expressions between conditions (e.g., cancer versus normal cells), for discovering the major factors that cause diseases. Because previous studies have not identified the correlations of differential gene expression between conditions, crucial but abnormal regulations that cause diseases might have been disregarded. This paper proposes an approach for discovering the condition-specific correlations of gene expressions within biological pathways. Because analyzing gene expression correlations is time consuming, an Apache Hadoop cloud computing platform was implemented. Three microarray data sets of breast cancer were collected from the Gene Expression Omnibus, and pathway information from the Kyoto Encyclopedia of Genes and Genomes was applied for discovering meaningful biological correlations. The results showed that adopting the Hadoop platform considerably decreased the computation time. Several correlations of differential gene expressions were discovered between the relapse and nonrelapse breast cancer samples, and most of them were involved in cancer regulation and cancer-related pathways. The results showed that breast cancer recurrence might be highly associated with the abnormal regulations of these gene pairs, rather than with their individual expression levels. The proposed method was computationally efficient and reliable, and stable results were obtained when different data sets were used. The proposed method is effective in identifying meaningful biological regulation patterns between conditions.

AB - Microarrays are widely used to assess gene expressions. Most microarray studies focus primarily on identifying differential gene expressions between conditions (e.g., cancer versus normal cells), for discovering the major factors that cause diseases. Because previous studies have not identified the correlations of differential gene expression between conditions, crucial but abnormal regulations that cause diseases might have been disregarded. This paper proposes an approach for discovering the condition-specific correlations of gene expressions within biological pathways. Because analyzing gene expression correlations is time consuming, an Apache Hadoop cloud computing platform was implemented. Three microarray data sets of breast cancer were collected from the Gene Expression Omnibus, and pathway information from the Kyoto Encyclopedia of Genes and Genomes was applied for discovering meaningful biological correlations. The results showed that adopting the Hadoop platform considerably decreased the computation time. Several correlations of differential gene expressions were discovered between the relapse and nonrelapse breast cancer samples, and most of them were involved in cancer regulation and cancer-related pathways. The results showed that breast cancer recurrence might be highly associated with the abnormal regulations of these gene pairs, rather than with their individual expression levels. The proposed method was computationally efficient and reliable, and stable results were obtained when different data sets were used. The proposed method is effective in identifying meaningful biological regulation patterns between conditions.

UR - http://www.scopus.com/inward/record.url?scp=84893808457&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84893808457&partnerID=8YFLogxK

U2 - 10.1155/2014/763237

DO - 10.1155/2014/763237

M3 - Article

VL - 2014

JO - BioMed Research International

JF - BioMed Research International

SN - 2314-6133

M1 - 763237

ER -