Sparse dictionary learning recovers pleiotropy from human cell fitness screens

التفاصيل البيبلوغرافية
العنوان: Sparse dictionary learning recovers pleiotropy from human cell fitness screens
المؤلفون: Pan, Joshua, Kwon, Jason J., Talamas, Jessica A., Borah, Ashir A., Vazquez, Francisca, Boehm, Jesse S., Tsherniak, Aviad, Zitnik, Marinka, McFarland, James M., Hahn, William C.
سنة النشر: 2021
المجموعة: Quantitative Biology
مصطلحات موضوعية: Quantitative Biology - Quantitative Methods, Quantitative Biology - Genomics, Quantitative Biology - Molecular Networks
الوصف: In high-throughput functional genomic screens, each gene product is commonly assumed to exhibit a singular biological function within a defined protein complex or pathway. In practice, a single gene perturbation may induce multiple cascading functional outcomes, a genetic principle known as pleiotropy. Here, we model pleiotropy in fitness screen collections by representing each gene perturbation as the sum of multiple perturbations of biological functions, each harboring independent fitness effects inferred empirically from the data. Our approach ('Webster') recovered pleiotropic functions for DNA damage proteins from genotoxic fitness screens, untangled distinct signaling pathways upstream of shared effector proteins from cancer cell fitness screens, and learned aspects of the cellular hierarchy in an unsupervised manner. Modeling compound sensitivity profiles in terms of genetically defined functions recovered compound mechanisms of action. Our approach establishes a sparse approximation mechanism for unraveling complex genetic architectures underlying high-dimensional gene perturbation readouts.
Comment: Accepted to the 16th Machine Learning in Computational Biology (MLCB) meeting 2021, and the Learning Meaningful Representations of Life (LMRL) Workshop at NeurIPS 2021
نوع الوثيقة: Working Paper
URL الوصول: http://arxiv.org/abs/2111.06247
رقم الأكسشن: edsarx.2111.06247
قاعدة البيانات: arXiv