دورية أكاديمية

Cooltools: Enabling high-resolution Hi-C analysis in Python.

التفاصيل البيبلوغرافية
العنوان: Cooltools: Enabling high-resolution Hi-C analysis in Python.
المؤلفون: Abdennur N; Department of Genomics and Computational Biology, University of Massachusetts Chan Medical School, Worcester, Massachusetts, United States of America.; Department of Systems Biology, University of Massachusetts Chan Medical School, Worcester, Massachusetts, United States of America., Abraham S; Institute for Medical Engineering and Sciences, Massachusetts Institute of Technology (MIT), Cambridge, Massachusetts, United States of America., Fudenberg G; Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, California, United States of America., Flyamer IM; Friedrich Miescher Institute for Biomedical Research, Basel, Switzerland., Galitsyna AA; Institute for Medical Engineering and Sciences, Massachusetts Institute of Technology (MIT), Cambridge, Massachusetts, United States of America., Goloborodko A; Institute of Molecular Biotechnology of the Austrian Academy of Sciences (IMBA), Vienna BioCenter (VBC), Vienna, Austria., Imakaev M; Institute for Medical Engineering and Sciences, Massachusetts Institute of Technology (MIT), Cambridge, Massachusetts, United States of America., Oksuz BA; Department of Systems Biology, University of Massachusetts Chan Medical School, Worcester, Massachusetts, United States of America., Venev SV; Department of Systems Biology, University of Massachusetts Chan Medical School, Worcester, Massachusetts, United States of America., Xiao Y; Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, California, United States of America.
مؤلفون مشاركون: Open2C
المصدر: PLoS computational biology [PLoS Comput Biol] 2024 May 06; Vol. 20 (5), pp. e1012067. Date of Electronic Publication: 2024 May 06 (Print Publication: 2024).
نوع المنشور: Journal Article
اللغة: English
بيانات الدورية: Publisher: Public Library of Science Country of Publication: United States NLM ID: 101238922 Publication Model: eCollection Cited Medium: Internet ISSN: 1553-7358 (Electronic) Linking ISSN: 1553734X NLM ISO Abbreviation: PLoS Comput Biol Subsets: MEDLINE
أسماء مطبوعة: Original Publication: San Francisco, CA : Public Library of Science, [2005]-
مواضيع طبية MeSH: Software* , Computational Biology*/methods, Programming Languages ; Genomics/methods ; Genome/genetics ; Chromosome Mapping/methods ; Humans
مستخلص: Chromosome conformation capture (3C) technologies reveal the incredible complexity of genome organization. Maps of increasing size, depth, and resolution are now used to probe genome architecture across cell states, types, and organisms. Larger datasets add challenges at each step of computational analysis, from storage and memory constraints to researchers' time; however, analysis tools that meet these increased resource demands have not kept pace. Furthermore, existing tools offer limited support for customizing analysis for specific use cases or new biology. Here we introduce cooltools (https://github.com/open2c/cooltools), a suite of computational tools that enables flexible, scalable, and reproducible analysis of high-resolution contact frequency data. Cooltools leverages the widely-adopted cooler format which handles storage and access for high-resolution datasets. Cooltools provides a paired command line interface (CLI) and Python application programming interface (API), which respectively facilitate workflows on high-performance computing clusters and in interactive analysis environments. In short, cooltools enables the effective use of the latest and largest genome folding datasets.
Competing Interests: The authors have declared that no competing interests exist.
(Copyright: © 2024 Open2C et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.)
References: Genome Biol. 2018 Dec 10;19(1):217. (PMID: 30526631)
Nat Genet. 2011 Oct 16;43(11):1059-65. (PMID: 22001755)
Nat Methods. 2020 Mar;17(3):261-272. (PMID: 32015543)
Bioinformatics. 2020 Jun 1;36(12):3645-3651. (PMID: 32311033)
Nat Cell Biol. 2017 Sep;19(9):1071-1080. (PMID: 28825700)
Cell Rep. 2020 Jan 21;30(3):820-835.e10. (PMID: 31968256)
PLoS Comput Biol. 2017 Jul 19;13(7):e1005665. (PMID: 28723903)
Nat Commun. 2021 Jan 4;12(1):41. (PMID: 33397980)
Nat Methods. 2012 Oct;9(10):999-1003. (PMID: 22941365)
Mol Cell. 2020 May 7;78(3):539-553.e8. (PMID: 32213323)
Nucleic Acids Res. 2020 Jul 2;48(W1):W177-W184. (PMID: 32301980)
Science. 2009 Oct 9;326(5950):289-93. (PMID: 19815776)
Nature. 2017 Sep 13;549(7671):219-226. (PMID: 28905911)
Science. 2018 Feb 9;359(6376):. (PMID: 29348367)
Mol Cell. 2022 Nov 17;82(22):4202-4217.e5. (PMID: 36302374)
Nucleic Acids Res. 2020 Jul 9;48(12):6699-6714. (PMID: 32479626)
Cell. 2014 Dec 18;159(7):1665-80. (PMID: 25497547)
Nat Commun. 2021 Oct 8;12(1):5827. (PMID: 34625553)
Nat Commun. 2019 Oct 22;10(1):4795. (PMID: 31641121)
Cell Syst. 2016 Jul;3(1):95-8. (PMID: 27467249)
Nat Methods. 2020 Oct;17(10):1002-1009. (PMID: 32968250)
Nat Methods. 2020 Nov;17(11):1111-1117. (PMID: 33046897)
Nat Commun. 2020 Nov 16;11(1):5795. (PMID: 33199682)
PeerJ. 2014 Jun 19;2:e453. (PMID: 25024921)
bioRxiv. 2023 Feb 15;:. (PMID: 36824968)
NAR Genom Bioinform. 2021 May 22;3(2):lqab040. (PMID: 34046591)
Nature. 2020 Sep;585(7825):357-362. (PMID: 32939066)
Nat Genet. 2021 Mar;53(3):367-378. (PMID: 33574602)
Mol Cell. 2020 Feb 20;77(4):688-708. (PMID: 32001106)
Genes Dev. 2020 Jul 1;34(13-14):931-949. (PMID: 32439634)
Nat Cell Biol. 2019 Nov;21(11):1393-1402. (PMID: 31685986)
Nat Commun. 2020 Nov 5;11(1):5612. (PMID: 33154377)
Cold Spring Harb Symp Quant Biol. 2017;82:45-55. (PMID: 29728444)
Nat Protoc. 2019 Nov;14(11):3243-3272. (PMID: 31619811)
Cold Spring Harb Perspect Biol. 2010 Mar;2(3):a003889. (PMID: 20300217)
Nat Struct Mol Biol. 2023 Jan;30(1):38-51. (PMID: 36550219)
Bioinformatics. 2024 Feb 1;40(2):. (PMID: 38402507)
BMC Genomics. 2017 Jan 5;18(1):22. (PMID: 28056762)
Nature. 2020 Oct;586(7827):139-144. (PMID: 32968280)
Nat Commun. 2019 Oct 3;10(1):4486. (PMID: 31582744)
Elife. 2020 Nov 10;9:. (PMID: 33170773)
Bioinformatics. 2020 May 1;36(10):2980-2985. (PMID: 32003791)
Curr Opin Cell Biol. 2019 Jun;58:142-152. (PMID: 31228682)
Nature. 2013 Oct 3;502(7469):59-64. (PMID: 24067610)
Nat Methods. 2021 Sep;18(9):1046-1055. (PMID: 34480151)
Bioinformatics. 2020 Jan 1;36(1):311-316. (PMID: 31290943)
Nat Commun. 2019 Oct 3;10(1):4485. (PMID: 31582763)
Genome Biol. 2018 Aug 24;19(1):125. (PMID: 30143029)
Nucleic Acids Res. 2022 Apr 8;50(6):3203-3225. (PMID: 35166842)
Science. 2004 Oct 22;306(5696):636-40. (PMID: 15499007)
Mol Cell. 2020 May 7;78(3):554-565.e7. (PMID: 32213324)
Nature. 2017 Nov 2;551(7678):51-56. (PMID: 29094699)
Genome Biol. 2020 Dec 17;21(1):303. (PMID: 33334380)
معلومات مُعتمدة: R01 HG003143 United States HG NHGRI NIH HHS; R35 GM143116 United States GM NIGMS NIH HHS; UM1 HG011536 United States HG NHGRI NIH HHS
تواريخ الأحداث: Date Created: 20240506 Date Completed: 20240516 Latest Revision: 20240518
رمز التحديث: 20240518
مُعرف محوري في PubMed: PMC11098495
DOI: 10.1371/journal.pcbi.1012067
PMID: 38709825
قاعدة البيانات: MEDLINE
الوصف
تدمد:1553-7358
DOI:10.1371/journal.pcbi.1012067