دورية أكاديمية
Using deep learning to associate human genes with age-related diseases.
العنوان: | Using deep learning to associate human genes with age-related diseases. |
---|---|
المؤلفون: | Fabris F; School of Computing, University of Kent, Canterbury, Kent CT2 7NF, UK., Palmer D; Integrative Genomics of Ageing Group, Institute of Ageing and Chronic Disease, University of Liverpool, Liverpool L7 8TX, UK., Salama KM; School of Computing, University of Kent, Canterbury, Kent CT2 7NF, UK., de Magalhães JP; Integrative Genomics of Ageing Group, Institute of Ageing and Chronic Disease, University of Liverpool, Liverpool L7 8TX, UK., Freitas AA; School of Computing, University of Kent, Canterbury, Kent CT2 7NF, UK. |
المصدر: | Bioinformatics (Oxford, England) [Bioinformatics] 2020 Apr 01; Vol. 36 (7), pp. 2202-2208. |
نوع المنشور: | Journal Article; Research Support, Non-U.S. Gov't |
اللغة: | English |
بيانات الدورية: | Publisher: Oxford University Press Country of Publication: England NLM ID: 9808944 Publication Model: Print Cited Medium: Internet ISSN: 1367-4811 (Electronic) Linking ISSN: 13674803 NLM ISO Abbreviation: Bioinformatics Subsets: MEDLINE |
أسماء مطبوعة: | Original Publication: Oxford : Oxford University Press, c1998- |
مواضيع طبية MeSH: | Deep Learning* , Machine Learning*, Aging ; Gene Ontology ; Humans ; Neural Networks, Computer |
مستخلص: | Motivation: One way to identify genes possibly associated with ageing is to build a classification model (from the machine learning field) capable of classifying genes as associated with multiple age-related diseases. To build this model, we use a pre-compiled list of human genes associated with age-related diseases and apply a novel Deep Neural Network (DNN) method to find associations between gene descriptors (e.g. Gene Ontology terms, protein-protein interaction data and biological pathway information) and age-related diseases. Results: The novelty of our new DNN method is its modular architecture, which has the capability of combining several sources of biological data to predict which ageing-related diseases a gene is associated with (if any). Our DNN method achieves better predictive performance than standard DNN approaches, a Gradient Boosted Tree classifier (a strong baseline method) and a Logistic Regression classifier. Given the DNN model produced by our method, we use two approaches to identify human genes that are not known to be associated with age-related diseases according to our dataset. First, we investigate genes that are close to other disease-associated genes in a complex multi-dimensional feature space learned by the DNN algorithm. Second, using the class label probabilities output by our DNN approach, we identify genes with a high probability of being associated with age-related diseases according to the model. We provide evidence of these putative associations retrieved from the DNN model with literature support. Availability and Implementation: The source code and datasets can be found at: https://github.com/fabiofabris/Bioinfo2019. Supplementary Information: Supplementary data are available at Bioinformatics online. (© The Author(s) 2019. Published by Oxford University Press.) |
References: | Semin Hematol. 2008 Jul;45(3):135-40. (PMID: 18582619) Proc Natl Acad Sci U S A. 2000 May 9;97(10):5456-61. (PMID: 10779546) PLoS One. 2011 Feb 18;6(2):e17258. (PMID: 21364756) Nucleic Acids Res. 2016 Jun 20;44(11):e107. (PMID: 27084946) Am J Physiol Renal Physiol. 2016 Apr 15;310(8):F689-F696. (PMID: 26719364) J Renin Angiotensin Aldosterone Syst. 2015 Jun;16(2):227-33. (PMID: 23934336) Bioinformatics. 2018 Feb 15;34(4):660-668. (PMID: 29028931) J Alzheimers Dis. 2011;24(4):669-79. (PMID: 21297254) Biogerontology. 2017 Apr;18(2):171-188. (PMID: 28265788) Nucleic Acids Res. 2018 Jan 4;46(D1):D1083-D1090. (PMID: 29121237) Mol Syst Biol. 2016 Jul 29;12(7):878. (PMID: 27474269) Nature. 2014 Jan 9;505(7482):169-73. (PMID: 24317695) Trends Biotechnol. 2017 Nov;35(11):1062-1073. (PMID: 28778607) Health Aff (Millwood). 2013 Oct;32(10):1698-705. (PMID: 24101058) Hum Mol Genet. 2016 Nov 1;25(21):4804-4818. (PMID: 28175300) PLoS One. 2016 Apr 14;11(4):e0153480. (PMID: 27078871) Brief Funct Genomics. 2012 Nov;11(6):533-42. (PMID: 23063808) Ann Rheum Dis. 2016 Apr;76(4):654-660. (PMID: 28073800) Nat Genet. 2017 Oct;49(10):1450-1457. (PMID: 28869590) Eur J Epidemiol. 2016 Apr;31(4):337-50. (PMID: 27209009) Bioinformatics. 2018 Jul 15;34(14):2449-2456. (PMID: 29462247) Nat Genet. 2004 May;36(5):431-2. (PMID: 15118671) DNA Cell Biol. 2014 Jul;33(7):463-8. (PMID: 24839897) Oncotarget. 2018 Oct 26;9(84):35500-35511. (PMID: 30464806) Eur J Epidemiol. 2010 Apr;25(4):225-30. (PMID: 20339903) Gynecol Oncol. 2008 Jun;109(3):418-25. (PMID: 18395779) |
معلومات مُعتمدة: | BB/R014949/1 United Kingdom BB_ Biotechnology and Biological Sciences Research Council; 208375/Z/17/Z United Kingdom WT_ Wellcome Trust |
تواريخ الأحداث: | Date Created: 20191218 Date Completed: 20200916 Latest Revision: 20200916 |
رمز التحديث: | 20231215 |
مُعرف محوري في PubMed: | PMC7141856 |
DOI: | 10.1093/bioinformatics/btz887 |
PMID: | 31845988 |
قاعدة البيانات: | MEDLINE |
تدمد: | 1367-4811 |
---|---|
DOI: | 10.1093/bioinformatics/btz887 |