Despite extensive studies in allergen prediction, current approaches still have room for performance improvement and suffer from the problem of lack of interpretable biological features. Thus, developments of allergen prediction method from sequences have become highly important to facilitate in silico vaccine design. In this study, we propose a systematic approach to predict allergenic proteins by incorporating sequence and physicochemical properties in machine learning algorithms. In addition, predictive performance of previous studies could be overestimated due to high redundancy in the data sets. Therefore, we reduce sequence redundancy in the data set and experiment results show that we achieve better predictive performance when compared with other approaches. This study can help discover new prophylactic and therapeutic vaccines for diseases. Moreover, we analyze immunological features that can provide valuable insights into immunotherapies of allergy and autoimmune diseases in translational bioinformatics.
|名字||Proceedings - International Conference on Machine Learning and Cybernetics|
|會議||18th International Conference on Machine Learning and Cybernetics, ICMLC 2019|
|期間||7/7/19 → 7/10/19|