» Articles » PMID: 39461939

DGCNN Approach Links Metagenome-derived Taxon and Functional Information Providing Insight into Global Soil Organic Carbon

Overview
Date 2024 Oct 27
PMID 39461939
Authors
Affiliations
Soon will be listed here.
Abstract

Metagenomics can provide insight into the microbial taxa present in a sample and, through gene identification, the functional potential of the community. However, taxonomic and functional information are typically considered separately in downstream analyses. We develop interpretable machine learning (ML) approaches for modelling metagenomic data, combining the biological representation of species with their associated genetically encoded functions within models. We apply our methods to investigate soil organic carbon (SOC) stocks. First, we combine a diverse global set of soil microbiome samples with environmental data, improving the predictive performance of classic ML and providing new insights into the role of soil microbiomes in global carbon cycling. Our network analysis of predictive taxa identified by classical ML models provides context for their ecological significance, extending the focus beyond just the most predictive taxa to 'hidden' features within the model that might be considered less predictive using standard methods for explainability. We next develop unique graph representations for individual microbiomes, linking microbial taxa to their associated functions directly, enabling predictions of SOC via deep graph convolutional neural networks (DGCNNs). Interpretation of the DGCNNs distinguished between the importance of functions of key individual species, providing genome sequence differences, e.g., gene loss/acquisition, that associate with SOC. These approaches identify several members of the Verrucomicrobiaceae family and a range of genetically encoded functions, e.g., related to carbohydrate metabolism, as important for SOC stocks and effective global SOC predictors. These relatively understudied but widespread organisms could play an important role in SOC dynamics globally.

References
1.
Piton G, Allison S, Bahram M, Hildebrand F, Martiny J, Treseder K . Life history strategies of soil bacterial communities across global terrestrial biomes. Nat Microbiol. 2023; 8(11):2093-2102. DOI: 10.1038/s41564-023-01465-0. View

2.
Bunger W, Jiang X, Muller J, Hurek T, Reinhold-Hurek B . Novel cultivated endophytic Verrucomicrobia reveal deep-rooting traits of bacteria to associate with plants. Sci Rep. 2020; 10(1):8692. PMC: 7251102. DOI: 10.1038/s41598-020-65277-6. View

3.
Reiman D, Metwally A, Dai Y . Using convolutional neural networks to explore the microbiome. Annu Int Conf IEEE Eng Med Biol Soc. 2017; 2017:4269-4272. DOI: 10.1109/EMBC.2017.8037799. View

4.
Strickland M, Lauber C, Fierer N, Bradford M . Testing the functional significance of microbial community composition. Ecology. 2009; 90(2):441-51. DOI: 10.1890/08-0296.1. View

5.
Lal R, Monger C, Nave L, Smith P . The role of soil in regulation of climate. Philos Trans R Soc Lond B Biol Sci. 2021; 376(1834):20210084. PMC: 8349633. DOI: 10.1098/rstb.2021.0084. View