gFlora: a topology-aware method to discover functional co-response groups in soil microbial communities

التفاصيل البيبلوغرافية
العنوان: gFlora: a topology-aware method to discover functional co-response groups in soil microbial communities
المؤلفون: Chen, Nan, Schram, Merlijn, Bucur, Doina
سنة النشر: 2024
المجموعة: Computer Science
مصطلحات موضوعية: Computer Science - Machine Learning, Computer Science - Neural and Evolutionary Computing
الوصف: We aim to learn the functional co-response group: a group of taxa whose co-response effect (the representative characteristic of the group showing the total topological abundance of taxa) co-responds (associates well statistically) to a functional variable. Different from the state-of-the-art method, we model the soil microbial community as an ecological co-occurrence network with the taxa as nodes (weighted by their abundance) and their relationships (a combination from both spatial and functional ecological aspects) as edges (weighted by the strength of the relationships). Then, we design a method called gFlora which notably uses graph convolution over this co-occurrence network to get the co-response effect of the group, such that the network topology is also considered in the discovery process. We evaluate gFlora on two real-world soil microbiome datasets (bacteria and nematodes) and compare it with the state-of-the-art method. gFlora outperforms this on all evaluation metrics, and discovers new functional evidence for taxa which were so far under-studied. We show that the graph convolution step is crucial to taxa with relatively low abundance (thus removing the bias towards taxa with higher abundance), and the discovered bacteria of different genera are distributed in the co-occurrence network but still tightly connected among themselves, demonstrating that topologically they fill different but collaborative functional roles in the ecological community.
Comment: BIOKDD accepted. Note: the first version of this paper is not approved by all authors. the second version is the official version for BIOKDD camera-ready
نوع الوثيقة: Working Paper
URL الوصول: http://arxiv.org/abs/2407.03897
رقم الأكسشن: edsarx.2407.03897
قاعدة البيانات: arXiv