Prediction of lysine propionylation sites using biased SVM and incorporating four different sequence features into Chou’s PseAAC

التفاصيل البيبلوغرافية
العنوان: Prediction of lysine propionylation sites using biased SVM and incorporating four different sequence features into Chou’s PseAAC
المؤلفون: Jian-Jun He, Zhe Ju
المصدر: Journal of Molecular Graphics and Modelling. 76:356-363
بيانات النشر: Elsevier BV, 2017.
سنة النشر: 2017
مصطلحات موضوعية: 0301 basic medicine, Support Vector Machine, Lysine, Feature extraction, Feature selection, Biology, complex mixtures, 03 medical and health sciences, Protein acylation, Materials Chemistry, Feature (machine learning), Position-Specific Scoring Matrices, Amino Acid Sequence, Amino Acids, Physical and Theoretical Chemistry, Spectroscopy, Sequence, business.industry, Computational Biology, Reproducibility of Results, Acetylation, Pattern recognition, Computer Graphics and Computer-Aided Design, Support vector machine, 030104 developmental biology, Amino acid composition, lipids (amino acids, peptides, and proteins), Artificial intelligence, Peptides, business, Protein Processing, Post-Translational, Algorithms
الوصف: Lysine propionylation is an important and common protein acylation modification in both prokaryotes and eukaryotes. To better understand the molecular mechanism of propionylation, it is important to identify propionylated substrates and their corresponding propionylation sites accurately. In this study, a novel bioinformatics tool named PropPred is developed to predict propionylation sites by using multiple feature extraction and biased support vector machine. On the one hand, various features are incorporated, including amino acid composition, amino acid factors, binary encoding, and the composition of k-spaced amino acid pairs. And the F-score feature method and the incremental feature selection algorithm are adopted to remove the redundant features. On the other hand, the biased support vector machine algorithm is used to handle the imbalanced problem in propionylation sites training dataset. As illustrated by 10-fold cross-validation, the performance of PropPred achieves a satisfactory performance with a Sensitivity of 70.03%, a Specificity of 75.61%, an accuracy of 75.02% and a Matthew's correlation coefficient of 0.3085. Feature analysis shows that some amino acid factors play the most important roles in the prediction of propionylation sites. These analysis and prediction results might provide some clues for understanding the molecular mechanisms of propionylation. A user-friendly web-server for PropPred is established at 123.206.31.171/PropPred/.
تدمد: 1093-3263
URL الوصول: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::31f96df3dc68078c006ef59dd3864fb0
https://doi.org/10.1016/j.jmgm.2017.07.022
حقوق: CLOSED
رقم الأكسشن: edsair.doi.dedup.....31f96df3dc68078c006ef59dd3864fb0
قاعدة البيانات: OpenAIRE