دورية أكاديمية

Semi-supervised Learning Predicts Approximately One Third of the Alternative Splicing Isoforms as Functional Proteins

التفاصيل البيبلوغرافية
العنوان: Semi-supervised Learning Predicts Approximately One Third of the Alternative Splicing Isoforms as Functional Proteins
المؤلفون: Yanqi Hao, Recep Colak, Joan Teyra, Carles Corbi-Verge, Alexander Ignatchenko, Hannes Hahne, Mathias Wilhelm, Bernhard Kuster, Pascal Braun, Daisuke Kaida, Thomas Kislinger, Philip M. Kim
المصدر: Cell Reports, Vol 12, Iss 2, Pp 183-189 (2015)
بيانات النشر: Elsevier, 2015.
سنة النشر: 2015
المجموعة: LCC:Biology (General)
مصطلحات موضوعية: Biology (General), QH301-705.5
الوصف: Alternative splicing acts on transcripts from almost all human multi-exon genes. Notwithstanding its ubiquity, fundamental ramifications of splicing on protein expression remain unresolved. The number and identity of spliced transcripts that form stably folded proteins remain the sources of considerable debate, due largely to low coverage of experimental methods and the resulting absence of negative data. We circumvent this issue by developing a semi-supervised learning algorithm, positive unlabeled learning for splicing elucidation (PULSE; http://www.kimlab.org/software/pulse), which uses 48 features spanning various categories. We validated its accuracy on sets of bona fide protein isoforms and directly on mass spectrometry (MS) spectra for an overall AU-ROC of 0.85. We predict that around 32% of “exon skipping” alternative splicing events produce stable proteins, suggesting that the process engenders a significant number of previously uncharacterized proteins. We also provide insights into the distribution of positive isoforms in various functional classes and into the structural effects of alternative splicing.
نوع الوثيقة: article
وصف الملف: electronic resource
اللغة: English
تدمد: 2211-1247
Relation: http://www.sciencedirect.com/science/article/pii/S2211124715006439; https://doaj.org/toc/2211-1247
DOI: 10.1016/j.celrep.2015.06.031
URL الوصول: https://doaj.org/article/84cadc28cef541d2a67e7163e22ad689
رقم الأكسشن: edsdoj.84cadc28cef541d2a67e7163e22ad689
قاعدة البيانات: Directory of Open Access Journals
الوصف
تدمد:22111247
DOI:10.1016/j.celrep.2015.06.031