دورية أكاديمية

StartLink and StartLink+: Prediction of Gene Starts in Prokaryotic Genomes

التفاصيل البيبلوغرافية
العنوان: StartLink and StartLink+: Prediction of Gene Starts in Prokaryotic Genomes
المؤلفون: Karl Gemayel, Alexandre Lomsadze, Mark Borodovsky
المصدر: Frontiers in Bioinformatics, Vol 1 (2021)
بيانات النشر: Frontiers Media S.A., 2021.
سنة النشر: 2021
المجموعة: LCC:Computer applications to medicine. Medical informatics
مصطلحات موضوعية: gene prediction, inference of translation initiation start, multiple sequence alignment, Kimura distance, integration of omics features, Computer applications to medicine. Medical informatics, R858-859.7
الوصف: State-of-the-art algorithms of ab initio gene prediction for prokaryotic genomes were shown to be sufficiently accurate. A pair of algorithms would agree on predictions of gene 3′ends. Nonetheless, predictions of gene starts would not match for 15–25% of genes in a genome. This discrepancy is a serious issue that is difficult to be resolved due to the absence of sufficiently large sets of genes with experimentally verified starts. We have introduced StartLink that infers gene starts from conservation patterns revealed by multiple alignments of homologous nucleotide sequences. We also have introduced StartLink+ combining both ab initio and alignment-based methods. The ability of StartLink to predict the start of a given gene is restricted by the availability of homologs in a database. We observed that StartLink made predictions for 85% of genes per genome on average. The StartLink+ accuracy was shown to be 98–99% on the sets of genes with experimentally verified starts. In comparison with database annotations, we observed that the annotated gene starts deviated from the StartLink+ predictions for ∼5% of genes in AT-rich genomes and for 10–15% of genes in GC-rich genomes on average. The use of StartLink+ has a potential to significantly improve gene start annotation in genomic databases.
نوع الوثيقة: article
وصف الملف: electronic resource
اللغة: English
تدمد: 2673-7647
Relation: https://www.frontiersin.org/articles/10.3389/fbinf.2021.704157/full; https://doaj.org/toc/2673-7647
DOI: 10.3389/fbinf.2021.704157
URL الوصول: https://doaj.org/article/b250eb98531d4354b83d90432ff8e450
رقم الأكسشن: edsdoj.b250eb98531d4354b83d90432ff8e450
قاعدة البيانات: Directory of Open Access Journals
الوصف
تدمد:26737647
DOI:10.3389/fbinf.2021.704157