DeepEfflux: A 2D convolutional neural network model for identifying families of efflux proteins in transporters

Semmy Wellem Taju, Trinh Trung Duong Nguyen, Nguyen Quoc Khanh Le, Rosdyana Mangir Irawan Kusuma, Yu Yen Ou

研究成果: 雜誌貢獻文章同行評審

24 引文 斯高帕斯(Scopus)


Motivation: Efflux protein plays a key role in pumping xenobiotics out of the cells. The prediction of efflux family proteins involved in transport process of compounds is crucial for understanding family structures, functions and energy dependencies. Many methods have been proposed to classify efflux pump transporters without considerations of any pump specific of efflux protein families. In other words, efflux proteins protect cells from extrusion of foreign chemicals. Moreover, almost all efflux protein families have the same structure based on the analysis of significant motifs. The motif sequences consisting of the same amount of residues will have high degrees of residue similarity and thus will affect the classification process. Consequently, it is challenging but vital to recognize the structures and determine energy dependencies of efflux protein families. In order to efficiently identify efflux protein families with considering about pump specific, we developed a 2 D convolutional neural network (2 D CNN) model called DeepEfflux. DeepEfflux tried to capture the motifs of sequences around hidden target residues to use as hidden features of families. In addition, the 2 D CNN model uses a position-specific scoring matrix (PSSM) as an input. Three different datasets, each for one family of efflux protein, was fed into DeepEfflux, and then a 5-fold cross validation approach was used to evaluate the training performance. Results: The model evaluation results show that DeepEfflux outperforms traditional machine learning algorithms. Furthermore, the accuracy of 96.02%, 94.89% and 90.34% for classes A, B and C, respectively, in the independent test results show that our model can perform well and can be used as a reliable tool for identifying families of efflux proteins in transporters.
頁(從 - 到)3111-3117
出版狀態已發佈 - 1月 1 2018

ASJC Scopus subject areas

  • 統計與概率
  • 生物化學
  • 分子生物學
  • 電腦科學應用
  • 計算機理論與數學
  • 計算數學


深入研究「DeepEfflux: A 2D convolutional neural network model for identifying families of efflux proteins in transporters」主題。共同形成了獨特的指紋。