Using two-dimensional convolutional neural networks for identifying GTP binding sites in Rab proteins

Nguyen Quoc Khanh Le, Quang Thai Ho, Yu Yen Ou

Research output: Contribution to journalArticle

3 Citations (Scopus)

Abstract

Deep learning has been increasingly and widely used to solve numerous problems in various fields with state-of-the-art performance. It can also be applied in bioinformatics to reduce the requirement for feature extraction and reach high performance. This study attempts to use deep learning to predict GTP binding sites in Rab proteins, which is one of the most vital molecular functions in life science. A functional loss of GTP binding sites in Rab proteins has been implicated in a variety of human diseases (choroideremia, intellectual disability, cancer, Parkinson's disease). Therefore, creating a precise model to identify their functions is a crucial problem for understanding these diseases and designing the drug targets. Our deep learning model with two-dimensional convolutional neural network and position-specific scoring matrix profiles could identify GTP binding residues with achieved sensitivity of 92.3%, specificity of 99.8%, accuracy of 99.5%, and MCC of 0.92 for independent dataset. Compared with other published works, this approach achieved a significant improvement. Throughout the proposed study, we provide an effective model for predicting GTP binding sites in Rab proteins and a basis for further research that can apply deep learning in bioinformatics, especially in nucleotide binding site prediction.

Original languageEnglish
Article number1950005
JournalJournal of Bioinformatics and Computational Biology
Volume17
Issue number1
DOIs
Publication statusPublished - Feb 1 2019
Externally publishedYes

Fingerprint

Binding sites
Guanosine Triphosphate
Binding Sites
Learning
Neural networks
Proteins
Bioinformatics
Computational Biology
Choroideremia
Position-Specific Scoring Matrices
Biological Science Disciplines
Nucleotides
Intellectual Disability
Parkinson Disease
Feature extraction
Sensitivity and Specificity
Deep learning
Research
Pharmaceutical Preparations
Neoplasms

Keywords

  • deep learning
  • GTP binding site
  • nucleotide binding prediction
  • position-specific scoring matrix
  • Rab protein
  • vesicle membrane trafficking

ASJC Scopus subject areas

  • Biochemistry
  • Molecular Biology
  • Computer Science Applications

Cite this

Using two-dimensional convolutional neural networks for identifying GTP binding sites in Rab proteins. / Le, Nguyen Quoc Khanh; Ho, Quang Thai; Ou, Yu Yen.

In: Journal of Bioinformatics and Computational Biology, Vol. 17, No. 1, 1950005, 01.02.2019.

Research output: Contribution to journalArticle

@article{228a27497852435f851b3e9fca4d4138,
title = "Using two-dimensional convolutional neural networks for identifying GTP binding sites in Rab proteins",
abstract = "Deep learning has been increasingly and widely used to solve numerous problems in various fields with state-of-the-art performance. It can also be applied in bioinformatics to reduce the requirement for feature extraction and reach high performance. This study attempts to use deep learning to predict GTP binding sites in Rab proteins, which is one of the most vital molecular functions in life science. A functional loss of GTP binding sites in Rab proteins has been implicated in a variety of human diseases (choroideremia, intellectual disability, cancer, Parkinson's disease). Therefore, creating a precise model to identify their functions is a crucial problem for understanding these diseases and designing the drug targets. Our deep learning model with two-dimensional convolutional neural network and position-specific scoring matrix profiles could identify GTP binding residues with achieved sensitivity of 92.3{\%}, specificity of 99.8{\%}, accuracy of 99.5{\%}, and MCC of 0.92 for independent dataset. Compared with other published works, this approach achieved a significant improvement. Throughout the proposed study, we provide an effective model for predicting GTP binding sites in Rab proteins and a basis for further research that can apply deep learning in bioinformatics, especially in nucleotide binding site prediction.",
keywords = "deep learning, GTP binding site, nucleotide binding prediction, position-specific scoring matrix, Rab protein, vesicle membrane trafficking",
author = "Le, {Nguyen Quoc Khanh} and Ho, {Quang Thai} and Ou, {Yu Yen}",
year = "2019",
month = "2",
day = "1",
doi = "10.1142/S0219720019500057",
language = "English",
volume = "17",
journal = "Journal of Bioinformatics and Computational Biology",
issn = "0219-7200",
publisher = "World Scientific Publishing Co. Pte Ltd",
number = "1",

}

TY - JOUR

T1 - Using two-dimensional convolutional neural networks for identifying GTP binding sites in Rab proteins

AU - Le, Nguyen Quoc Khanh

AU - Ho, Quang Thai

AU - Ou, Yu Yen

PY - 2019/2/1

Y1 - 2019/2/1

N2 - Deep learning has been increasingly and widely used to solve numerous problems in various fields with state-of-the-art performance. It can also be applied in bioinformatics to reduce the requirement for feature extraction and reach high performance. This study attempts to use deep learning to predict GTP binding sites in Rab proteins, which is one of the most vital molecular functions in life science. A functional loss of GTP binding sites in Rab proteins has been implicated in a variety of human diseases (choroideremia, intellectual disability, cancer, Parkinson's disease). Therefore, creating a precise model to identify their functions is a crucial problem for understanding these diseases and designing the drug targets. Our deep learning model with two-dimensional convolutional neural network and position-specific scoring matrix profiles could identify GTP binding residues with achieved sensitivity of 92.3%, specificity of 99.8%, accuracy of 99.5%, and MCC of 0.92 for independent dataset. Compared with other published works, this approach achieved a significant improvement. Throughout the proposed study, we provide an effective model for predicting GTP binding sites in Rab proteins and a basis for further research that can apply deep learning in bioinformatics, especially in nucleotide binding site prediction.

AB - Deep learning has been increasingly and widely used to solve numerous problems in various fields with state-of-the-art performance. It can also be applied in bioinformatics to reduce the requirement for feature extraction and reach high performance. This study attempts to use deep learning to predict GTP binding sites in Rab proteins, which is one of the most vital molecular functions in life science. A functional loss of GTP binding sites in Rab proteins has been implicated in a variety of human diseases (choroideremia, intellectual disability, cancer, Parkinson's disease). Therefore, creating a precise model to identify their functions is a crucial problem for understanding these diseases and designing the drug targets. Our deep learning model with two-dimensional convolutional neural network and position-specific scoring matrix profiles could identify GTP binding residues with achieved sensitivity of 92.3%, specificity of 99.8%, accuracy of 99.5%, and MCC of 0.92 for independent dataset. Compared with other published works, this approach achieved a significant improvement. Throughout the proposed study, we provide an effective model for predicting GTP binding sites in Rab proteins and a basis for further research that can apply deep learning in bioinformatics, especially in nucleotide binding site prediction.

KW - deep learning

KW - GTP binding site

KW - nucleotide binding prediction

KW - position-specific scoring matrix

KW - Rab protein

KW - vesicle membrane trafficking

UR - http://www.scopus.com/inward/record.url?scp=85062847951&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85062847951&partnerID=8YFLogxK

U2 - 10.1142/S0219720019500057

DO - 10.1142/S0219720019500057

M3 - Article

C2 - 30866734

AN - SCOPUS:85062847951

VL - 17

JO - Journal of Bioinformatics and Computational Biology

JF - Journal of Bioinformatics and Computational Biology

SN - 0219-7200

IS - 1

M1 - 1950005

ER -