دورية أكاديمية

Mapping-friendly sequence reductions: Going beyond homopolymer compression

التفاصيل البيبلوغرافية
العنوان: Mapping-friendly sequence reductions: Going beyond homopolymer compression
المؤلفون: Luc Blassel, Paul Medvedev, Rayan Chikhi
المصدر: iScience, Vol 25, Iss 11, Pp 105305- (2022)
بيانات النشر: Elsevier, 2022.
سنة النشر: 2022
المجموعة: LCC:Science
مصطلحات موضوعية: Biological sciences, Molecular biology, Biological sciences research methodologies, Transcriptomics, Science
الوصف: Summary: Sequencing errors continue to pose algorithmic challenges to methods working with sequencing data. One of the simplest and most prevalent techniques for ameliorating the detrimental effects of homopolymer expansion/contraction errors present in long reads is homopolymer compression. It collapses runs of repeated nucleotides, to remove some sequencing errors and improve mapping sensitivity. Though our intuitive understanding justifies why homopolymer compression works, it in no way implies that it is the best transformation that can be done. In this paper, we explore if there are transformations that can be applied in the same pre-processing manner as homopolymer compression that would achieve better alignment sensitivity. We introduce a more general framework than homopolymer compression, called mapping-friendly sequence reductions. We transform the reference and the reads using these reductions and then apply an alignment algorithm. We demonstrate that some mapping-friendly sequence reductions lead to improved mapping accuracy, outperforming homopolymer compression.
نوع الوثيقة: article
وصف الملف: electronic resource
اللغة: English
تدمد: 2589-0042
Relation: http://www.sciencedirect.com/science/article/pii/S2589004222015772; https://doaj.org/toc/2589-0042
DOI: 10.1016/j.isci.2022.105305
URL الوصول: https://doaj.org/article/9f30d939a3c6423daf865a86dfb9d9d4
رقم الأكسشن: edsdoj.9f30d939a3c6423daf865a86dfb9d9d4
قاعدة البيانات: Directory of Open Access Journals
الوصف
تدمد:25890042
DOI:10.1016/j.isci.2022.105305