دورية أكاديمية

Epiclomal: Probabilistic clustering of sparse single-cell DNA methylation data

التفاصيل البيبلوغرافية
العنوان: Epiclomal: Probabilistic clustering of sparse single-cell DNA methylation data
المؤلفون: Camila P. E. de Souza, Mirela Andronescu, Tehmina Masud, Farhia Kabeer, Justina Biele, Emma Laks, Daniel Lai, Patricia Ye, Jazmine Brimhall, Beixi Wang, Edmund Su, Tony Hui, Qi Cao, Marcu
المصدر: Public Library of Science, PLOS Computational Biology. 16(9):1-24
سنة النشر: 2020
الوصف: We present Epiclomal, a probabilistic clustering method arising from a hierarchical mixture model to simultaneously cluster sparse single-cell DNA methylation data and impute missing values. Using synthetic and published single-cell CpG datasets, we show that Epiclomal outperforms non-probabilistic methods and can handle the inherent missing data characteristic that dominates single-cell CpG genome sequences. Using newly generated single-cell 5mCpG sequencing data, we show that Epiclomal discovers sub-clonal methylation patterns in aneuploid tumour genomes, thus defining epiclones that can match or transcend copy number-determined clonal lineages and opening up an important form of clonal analysis in cancer. Epiclomal is written in R and Python and is available at https://github.com/shahcompbio/Epiclomal.Author summary: DNA methylation is an epigenetic mark that occurs when methyl groups are attached to the DNA molecule, thereby playing decisive roles in numerous biological processes. Advances in technology have allowed the generation of high-throughput DNA methylation sequencing data from single cells. One of the goals is to group cells according to their DNA methylation profiles; however, a major challenge arises due to a large amount of missing data per cell. To address this problem, we developed a novel statistical model and framework: Epiclomal. Our approach uses a hierarchical mixture model to borrow statistical strength across cells and neighboring loci to accurately define cell groups (clusters). We compare our approach to different methods on both synthetic and published datasets. We show that Epiclomal is more robust than other approaches, producing more accurate clusters of cells in the majority of experimental scenarios. We also apply Epiclomal to newly generated single-cell DNA methylation data from breast cancer xenografts. Our results show that methylation-based clusters can mirror or in some instances transcend the clusters defined by single-cell cop
نوع الوثيقة: redif-article
اللغة: English
DOI: 10.1371/journal.pcbi.1008
الإتاحة: https://ideas.repec.org/a/plo/pcbi00/1008270.html
رقم الأكسشن: edsrep.a.plo.pcbi00.1008270
قاعدة البيانات: RePEc
الوصف
DOI:10.1371/journal.pcbi.1008