Sequence-based Prediction of Pathogen-host Interaction Using an Ensemble Learning Classifier and Moran Autocorrelation Feature Encoding Method
Main Article Content
Abstract
Pathogen–host protein interaction (PHI) is an interaction between two proteins from different organism. Knowledge about an
effect of a PHI help to study how a virus can infects an organism and also to develop a drug design for treat the corresponding
disease. There are a lot of computational methods that has been developed to predict whether or not an interaction between a
pair of protein so a researchers can learn PHI more efficient, especially in terms of cost and time. One of computational
method is to predict a possibility of protein interaction using only their amino acid sequences. This paper examined a method
of PHI prediction using moran autocorrelation as the encoding feature. In this paper, we develop an ensemble learning model
as classifier (ELC) using combination of SVM, RF and GBDT classifier. We also compare the result obtained from the
proposed method with the use the other machine learning methods such as gradient boosting, random forest, support vector
machine, and recurrent neural network. ELC was superior than the other in terms of accuracy, the MAC-ELC achieved average
accuracy up to 77.85, while the others are below 77%. The method we proposed also good in terms of give an average of
sensitivity 81.69%, specificity 73.90% and F1 score 78.92%.
Downloads
Metrics
Article Details
You are free to:
- Share — copy and redistribute the material in any medium or format for any purpose, even commercially.
- Adapt — remix, transform, and build upon the material for any purpose, even commercially.
- The licensor cannot revoke these freedoms as long as you follow the license terms.
Under the following terms:
- Attribution — You must give appropriate credit , provide a link to the license, and indicate if changes were made . You may do so in any reasonable manner, but not in any way that suggests the licensor endorses you or your use.
- No additional restrictions — You may not apply legal terms or technological measures that legally restrict others from doing anything the license permits.
Notices:
You do not have to comply with the license for elements of the material in the public domain or where your use is permitted by an applicable exception or limitation .
No warranties are given. The license may not give you all of the permissions necessary for your intended use. For example, other rights such as publicity, privacy, or moral rights may limit how you use the material.