Best Practices in Designing, Sequencing and Identifying Random DNA Barcodes

التفاصيل البيبلوغرافية
العنوان: Best Practices in Designing, Sequencing and Identifying Random DNA Barcodes
المؤلفون: Johnson, Milo, Venkataram, Sandeep, Kryazhimskiy, Sergey
بيانات النشر: Zenodo, 2022.
سنة النشر: 2022
مصطلحات موضوعية: bepress|Life Sciences, bepress|Life Sciences|Biotechnology, Genetics, DNA barcode, bepress|Life Sciences|Bioinformatics, bepress|Life Sciences|Cell and Developmental Biology, bepress|Life Sciences|Ecology and Evolutionary Biology, Molecular Biology, bepress|Life Sciences|Ecology and Evolutionary Biology|Evolution, Ecology, Evolution, Behavior and Systematics
الوصف: Random DNA barcodes are a versatile tool for tracking cell lineages, with applications ranging from development to cancer to evolution. Here we review and critically evaluate barcode designs as well as methods of barcode sequencing and initial processing of barcode data. We first demonstrate how various barcode design decisions affect data quality and propose a new optimal design that balances all considerations that we are currently aware of. We then discuss various options for the preparation of barcode sequencing libraries, including inline indices and Unique Molecular Identifiers (UMIs). Our main conclusion is that the utility of inline indices is high whereas that of UMIs is low. Finally, we test the performance of several established and new bioinformatic pipelines for the extraction of barcodes from raw sequencing reads and for error correction. We find that both alignment and regular expression-based approaches work well for barcode extraction, and that error correction pipelines designed specifically for barcode data are superior to generic ones. Overall, this review will help researchers approach their barcoding experiments in a deliberate and systematic way.
DOI: 10.5281/zenodo.7411747
URL الوصول: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::bd122fdcb32c807e3a9699953104fc03
حقوق: OPEN
رقم الأكسشن: edsair.doi.dedup.....bd122fdcb32c807e3a9699953104fc03
قاعدة البيانات: OpenAIRE