دورية أكاديمية

Correction of transposase sequence bias in ATAC-seq data with rule ensemble modeling.

التفاصيل البيبلوغرافية
العنوان: Correction of transposase sequence bias in ATAC-seq data with rule ensemble modeling.
المؤلفون: Wolpe JB; Department of Biochemistry and Molecular Genetics, University of Virginia, Charlottesville, VA, USA., Martins AL; Center for Cell Analysis and Modeling, University of Connecticut, Farmington, CT, USA.; Department of Genetics and Genome Sciences, University of Connecticut, Farmington, CT, USA., Guertin MJ; Center for Cell Analysis and Modeling, University of Connecticut, Farmington, CT, USA.; Department of Genetics and Genome Sciences, University of Connecticut, Farmington, CT, USA.
المصدر: NAR genomics and bioinformatics [NAR Genom Bioinform] 2023 Jun 02; Vol. 5 (2), pp. lqad054. Date of Electronic Publication: 2023 Jun 02 (Print Publication: 2023).
نوع المنشور: Journal Article
اللغة: English
بيانات الدورية: Publisher: Oxford University Press Country of Publication: England NLM ID: 101756213 Publication Model: eCollection Cited Medium: Internet ISSN: 2631-9268 (Electronic) Linking ISSN: 26319268 NLM ISO Abbreviation: NAR Genom Bioinform Subsets: PubMed not MEDLINE
أسماء مطبوعة: Original Publication: [Oxford] : Oxford University Press, [2019]-
مستخلص: Chromatin accessibility assays have revolutionized the field of transcription regulation by providing single-nucleotide resolution measurements of regulatory features such as promoters and transcription factor binding sites. ATAC-seq directly measures how well the Tn5 transposase accesses chromatinized DNA. Tn5 has a complex sequence bias that is not effectively scaled with traditional bias-correction methods. We model this complex bias using a rule ensemble machine learning approach that integrates information from many input k-mers proximal to the ATAC sequence reads. We effectively characterize and correct single-nucleotide sequence biases and regional sequence biases of the Tn5 enzyme. Correction of enzymatic sequence bias is an important step in interpreting chromatin accessibility assays that aim to infer transcription factor binding and regulatory activity of elements in the genome.
(© The Author(s) 2023. Published by Oxford University Press on behalf of NAR Genomics and Bioinformatics.)
References: Mol Cell. 2014 Oct 23;56(2):275-285. (PMID: 25242143)
Annu Rev Genet. 2008;42:269-86. (PMID: 18680433)
Nucleic Acids Res. 2015 Jul 1;43(W1):W39-49. (PMID: 25953851)
Nature. 2020 Jul;583(7818):699-710. (PMID: 32728249)
Curr Protoc Mol Biol. 2015 Jan 05;109:21.29.1-21.29.9. (PMID: 25559105)
Nat Methods. 2013 Dec;10(12):1213-8. (PMID: 24097267)
Nucleic Acids Res. 2018 Jan 25;46(2):e9. (PMID: 29126307)
Nucleic Acids Res. 1990 Oct 25;18(20):6097-100. (PMID: 2172928)
Nature. 2001 Feb 15;409(6822):860-921. (PMID: 11237011)
Genome Res. 2017 Oct;27(10):1730-1742. (PMID: 28904015)
PLoS One. 2015 Sep 25;10(9):e0138030. (PMID: 26406244)
Nucleic Acids Res. 2022 Jan 7;50(D1):D165-D173. (PMID: 34850907)
BMC Bioinformatics. 2017 Aug 1;18(1):357. (PMID: 28764645)
Cell. 1979 Apr;16(4):797-806. (PMID: 455449)
Genome Biol. 2019 Feb 26;20(1):45. (PMID: 30808370)
Bioinformatics. 2010 Sep 1;26(17):2204-7. (PMID: 20639541)
Genome Biol. 2012 Oct 03;13(10):R88. (PMID: 23034120)
Bioinformatics. 2011 Apr 1;27(7):1017-8. (PMID: 21330290)
Proc Natl Acad Sci U S A. 2013 Apr 16;110(16):6376-81. (PMID: 23576721)
Nat Methods. 2014 Jan;11(1):73-78. (PMID: 24317252)
EMBO J. 2009 May 20;28(10):1418-28. (PMID: 19339991)
Am J Respir Cell Mol Biol. 2017 Aug;57(2):226-237. (PMID: 28375666)
Nucleic Acids Res. 2014 Oct 29;42(19):11865-78. (PMID: 25294828)
PLoS Comput Biol. 2008 Aug 22;4(8):e1000154. (PMID: 18725950)
Cell. 1979 Apr;16(4):807-14. (PMID: 455450)
Genome Res. 2019 Feb;29(2):281-292. (PMID: 30567711)
Gigascience. 2021 Feb 16;10(2):. (PMID: 33590861)
Genome Biol. 2019 Feb 21;20(1):42. (PMID: 30791920)
PLoS Genet. 2012;8(3):e1002610. (PMID: 22479205)
Mol Microbiol. 2003 Mar;47(5):1199-206. (PMID: 12603728)
Nature. 2012 Sep 6;489(7414):75-82. (PMID: 22955617)
Epigenetics Chromatin. 2012 Jun 26;5(1):10. (PMID: 22734930)
Nucleic Acids Res. 2011 Jan;39(Database issue):D19-21. (PMID: 21062823)
Genome Res. 2004 Jun;14(6):1188-90. (PMID: 15173120)
Nat Methods. 2012 Mar 04;9(4):357-9. (PMID: 22388286)
NAR Genom Bioinform. 2021 Nov 23;3(4):lqab101. (PMID: 34859208)
Nat Commun. 2022 Sep 21;13(1):5533. (PMID: 36130957)
Curr Biol. 2017 Aug 7;27(15):2365-2373.e8. (PMID: 28756949)
PLoS One. 2013 Jul 26;8(7):e69853. (PMID: 23922824)
Bioinformatics. 2010 Mar 15;26(6):841-2. (PMID: 20110278)
Nat Commun. 2020 Aug 26;11(1):4267. (PMID: 32848148)
Nature. 2002 Dec 5;420(6915):520-62. (PMID: 12466850)
Cell. 2008 Jan 25;132(2):311-22. (PMID: 18243105)
معلومات مُعتمدة: R35 GM128635 United States GM NIGMS NIH HHS
تواريخ الأحداث: Date Created: 20230605 Latest Revision: 20230919
رمز التحديث: 20231215
مُعرف محوري في PubMed: PMC10236359
DOI: 10.1093/nargab/lqad054
PMID: 37274120
قاعدة البيانات: MEDLINE
الوصف
تدمد:2631-9268
DOI:10.1093/nargab/lqad054