دورية أكاديمية

ExRec: a python pipeline for generating recombination-filtered multi-locus datasets.

التفاصيل البيبلوغرافية
العنوان: ExRec: a python pipeline for generating recombination-filtered multi-locus datasets.
المؤلفون: Potter, Sam McCarthy, Jennings, W Bryan
المصدر: Bioinformatics Advances; 2023, Vol. 3 Issue 1, p1-5, 5p
مصطلحات موضوعية: PHYLOGENY, PYTHON programming language, AUTOMATION, COMPUTATIONAL biology, BIOINFORMATICS
مستخلص: Summary ExRec (Exclusion of Recombined DNA) is a dependency-free Python pipeline that implements the four-gamete test to automatically filter out recombined DNA blocks from thousands of DNA sequence loci. This procedure helps all loci better meet the "no intralocus recombination" assumption common to many coalescent-based analyses in population genomic, phylogeographic, and shallow-scale phylogenomic studies. The user-friendly pipeline contains five standalone applications—four file conversion scripts and one main script that performs the recombination filtering procedures. The pipeline outputs recombination-filtered data in a variety of common formats and a tab-delimited table that displays descriptive statistics for all loci and the analysis results. A novel feature of this software is that the user can select whether to output the longest nonrecombined sequence blocks from recombined loci (current best practice) or randomly select nonrecombined blocks from loci (a newer approach). We tested ExRec with six published phylogenomic datasets that ranged in size from 27 to 2237 loci and came in a variety of input file formats. In all trials the data could be easily analyzed in only seconds for the smaller datasets and <30 min for the largest using a simple laptop computer. Availability and implementation ExRec was written in Python 3 under the MIT license. The program applications, user manual (including step-by-step tutorials), and sample data are freely available at https://github.com/Sammccarthypotter/ExRec. [ABSTRACT FROM AUTHOR]
Copyright of Bioinformatics Advances is the property of Oxford University Press / USA and its content may not be copied or emailed to multiple sites or posted to a listserv without the copyright holder's express written permission. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
قاعدة البيانات: Complementary Index