Comparison of the predictive outcomes for anti-tuberculosis drug-induced hepatotoxicity by different machine learning techniques

Nai Hua Lai, Wan Chen Shen, Chun Nin Lee, Jui Chia Chang, Man Ching Hsu, Li Na Kuo, Ming Chih Yu, Hsiang Yin Chen

Research output: Contribution to journalArticle


Background: The study compared the predictive outcomes of artificial neural network, support vector machine and random forest on the occurrence of anti-tuberculosis drug-induced hepatotoxicity. Methods: The clinical and genomic data of patients treated with anti-tuberculosis drugs at Taipei Medical University-Wanfang Hospital were used as training sets, and those at Taipei Medical University-Shuang Ho Hospital served as test sets. Features were selected through a univariate risk factor analysis and literature evaluation. The accuracy, sensitivity, specificity, and the area under the receiver operating characteristic curve were calculated to compare the traditional, genomic, and combined models of the three techniques. Results: Nine models were created with 7 clinical factors and 4 genotypes. Artificial neural network with clinical and genomic factors exhibited the best performance, with an accuracy of 88.67%, a sensitivity of 80%, and a specificity of 90.4% for the test set. The area under the receiver operating characteristic curve of this best model reached 0.894 for training set and 0.898 for test set, which was significantly better than 0.801 for training set and 0.728 for test set by support vector machine and 0.724 for training set and 0.718 for test set by random forest. Conclusions: Artificial neural network with clinical and genomic data can become a clinical useful tool in predicting anti-tuberculosis drug-induced hepatotoxicity. The machine learning technique can be an innovation to predict and prevent adverse drug reaction.

Original languageEnglish
Article number105307
JournalComputer Methods and Programs in Biomedicine
Publication statusPublished - May 2020



  • Anti-tuberculosis drugs
  • Artificial neural network
  • Feature selection
  • Gene polymorphism
  • Random forest
  • Support vector machine
  • Tuberculosis

ASJC Scopus subject areas

  • Software
  • Computer Science Applications
  • Health Informatics

Cite this