دورية أكاديمية
An effective deep learning-based approach for splice site identification in gene expression.
العنوان: | An effective deep learning-based approach for splice site identification in gene expression. |
---|---|
المؤلفون: | Ali M; Department of Computer Science, Bacha Khan University, Charsadda, KP, Pakistan., Shah D; Department of Computer Science, Bacha Khan University, Charsadda, KP, Pakistan., Qazi S; Department of Computer Science, Bacha Khan University, Charsadda, KP, Pakistan., Khan IA; Department of Computer Science, Bacha Khan University, Charsadda, KP, Pakistan., Abrar M; Faculty of Computer Science, Arab Open University, Muscat, Oman, Sultanate of Oman., Zahir S; Institute of Computer Sciences and Information Technology, The University of Agriculture Peshawar, Peshawar, KP, Pakistan. |
المصدر: | Science progress [Sci Prog] 2024 Jul-Sep; Vol. 107 (3), pp. 368504241266588. |
نوع المنشور: | Journal Article |
اللغة: | English |
بيانات الدورية: | Publisher: SAGE Publications Country of Publication: England NLM ID: 0411361 Publication Model: Print Cited Medium: Internet ISSN: 2047-7163 (Electronic) Linking ISSN: 00368504 NLM ISO Abbreviation: Sci Prog Subsets: MEDLINE |
أسماء مطبوعة: | Publication: <2019-> : [London] : SAGE Publications Original Publication: Oxford, Blackwell Scientific Publications [etc.] |
مواضيع طبية MeSH: | Deep Learning* , RNA Splice Sites*/genetics , Neural Networks, Computer*, Humans ; RNA Splicing ; Introns/genetics ; RNA, Messenger/genetics ; Algorithms ; Gene Expression ; Computational Biology/methods ; Exons/genetics |
مستخلص: | A crucial stage in eukaryote gene expression involves mRNA splicing by a protein assembly known as the spliceosome. This step significantly contributes to generating and properly operating the ultimate gene product. Since non-coding introns disrupt eukaryotic genes, splicing entails the elimination of introns and joining exons to create a functional mRNA molecule. Nevertheless, accurately finding splice sequence sites using various molecular biology techniques and other biological approaches is complex and time-consuming. This paper presents a precise and reliable computer-aided diagnosis (CAD) technique for the rapid and correct identification of splice site sequences. The proposed deep learning-based framework uses long short-term memory (LSTM) to extract distinct patterns from RNA sequences, enabling rapid and accurate point mutation sequence mapping. The proposed network employs one-hot encodings to find sequential patterns that effectively identify splicing sites. A thorough ablation study of traditional machine learning, one-dimensional convolutional neural networks (1D-CNNs), and recurrent neural networks (RNNs) models was conducted. The proposed LSTM network outperformed existing state-of-the-art approaches, improving accuracy by 3% and 2% for the acceptor and donor sites datasets. Competing Interests: Declaration of conflicting interestsThe authors declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article. |
References: | Genome Res. 2003 Dec;13(12):2637-50. (PMID: 14656968) Nucleic Acids Res. 2014 Dec 1;42(21):12961-72. (PMID: 25361964) Comput Methods Programs Biomed. 2016 May;128:1-11. (PMID: 27040827) Biomed Res Int. 2014;2014:623149. (PMID: 24967386) Nucleic Acids Res. 2001 Jul 15;29(14):2994-3005. (PMID: 11452024) Cell. 2019 Jan 24;176(3):535-548.e24. (PMID: 30661751) Biochem Biophys Res Commun. 2013 Feb 8;431(2):221-4. (PMID: 23313482) Diagnostics (Basel). 2022 Nov 09;12(11):. (PMID: 36359579) BMC Genomics. 2018 Jul 3;19(1):511. (PMID: 29970003) J Mol Biol. 1997 Apr 25;268(1):78-94. (PMID: 9149143) Energy Build. 2023 Sep 1;294:113204. (PMID: 37342253) Trends Biochem Sci. 2012 May;37(5):179-88. (PMID: 22480731) J Comput Biol. 1997 Summer;4(2):127-41. (PMID: 9228612) Sci Rep. 2017 Aug 15;7(1):8222. (PMID: 28811565) Nucleic Acids Res. 2001 Mar 1;29(5):1185-90. (PMID: 11222768) J Comput Biol. 1997 Fall;4(3):311-23. (PMID: 9278062) Proc Natl Acad Sci U S A. 1997 Jan 21;94(2):565-8. (PMID: 9012824) Bioinformatics. 2006 May 15;22(10):1207-10. (PMID: 16481334) Comput Biol Med. 1997 Jan;27(1):67-75. (PMID: 9055047) PLoS One. 2011;6(6):e20592. (PMID: 21698097) Nucleic Acids Res. 2013 Apr 1;41(6):e68. (PMID: 23303794) Int J Mol Sci. 2014 Jan 24;15(2):1746-66. (PMID: 24469313) Hum Mutat. 2019 Sep;40(9):1261-1269. (PMID: 31090248) |
فهرسة مساهمة: | Keywords: Artificial intelligence; RNA analysis; biomedical data; deep learning; genomics; splicing sites |
المشرفين على المادة: | 0 (RNA Splice Sites) 0 (RNA, Messenger) |
تواريخ الأحداث: | Date Created: 20240725 Date Completed: 20240725 Latest Revision: 20240728 |
رمز التحديث: | 20240728 |
مُعرف محوري في PubMed: | PMC11273556 |
DOI: | 10.1177/00368504241266588 |
PMID: | 39051530 |
قاعدة البيانات: | MEDLINE |
تدمد: | 2047-7163 |
---|---|
DOI: | 10.1177/00368504241266588 |