» Articles » PMID: 20547594

Multivariate Cutoff Level Analysis (MultiCoLA) of Large Community Data Sets

Overview
Specialty Biochemistry
Date 2010 Jun 16
PMID 20547594
Citations 45
Authors
Affiliations
Soon will be listed here.
Abstract

High-throughput sequencing techniques are becoming attractive to molecular biologists and ecologists as they provide a time- and cost-effective way to explore diversity patterns in environmental samples at an unprecedented resolution. An issue common to many studies is the definition of what fractions of a data set should be considered as rare or dominant. Yet this question has neither been satisfactorily addressed, nor is the impact of such definition on data set structure and interpretation been fully evaluated. Here we propose a strategy, MultiCoLA (Multivariate Cutoff Level Analysis), to systematically assess the impact of various abundance or rarity cutoff levels on the resulting data set structure and on the consistency of the further ecological interpretation. We applied MultiCoLA to a 454 massively parallel tag sequencing data set of V6 ribosomal sequences from marine microbes in temperate coastal sands. Consistent ecological patterns were maintained after removing up to 35-40% rare sequences and similar patterns of beta diversity were observed after denoising the data set by using a preclustering algorithm of 454 flowgrams. This example validates the importance of exploring the impact of the definition of rarity in large community data sets. Future applications can be foreseen for data sets from different types of habitats, e.g. other marine environments, soil and human microbiota.

Citing Articles

Differential roles of deterministic and stochastic processes in structuring soil bacterial ecotypes across terrestrial ecosystems.

Riddley M, Hepp S, Hardeep F, Nayak A, Liu M, Xing X Nat Commun. 2025; 16(1):2337.

PMID: 40057505 PMC: 11890569. DOI: 10.1038/s41467-025-57526-x.


Plant species within Streptanthoid Complex associate with distinct microbial communities that shift to be more similar under drought.

Igwe A, Pearse I, Aguilar J, Strauss S, Vannette R Ecol Evol. 2024; 14(3):e11174.

PMID: 38529025 PMC: 10961476. DOI: 10.1002/ece3.11174.


Gut microbiota non-convergence and adaptations in sympatric Tibetan and Przewalski's gazelles.

Song P, Jiang F, Liu D, Cai Z, Gao H, Gu H iScience. 2024; 27(3):109117.

PMID: 38384851 PMC: 10879710. DOI: 10.1016/j.isci.2024.109117.


Shrub expansion raises both aboveground and underground multifunctionality on a subtropical plateau grassland: coupling multitrophic community assembly to multifunctionality and functional trade-off.

Ding L, Chen H, Wang M, Wang P Front Microbiol. 2024; 14:1339125.

PMID: 38274762 PMC: 10808678. DOI: 10.3389/fmicb.2023.1339125.


Assembly processes and functional diversity of marine protists and their rare biosphere.

Ramond P, Siano R, Sourisseau M, Logares R Environ Microbiome. 2023; 18(1):59.

PMID: 37443126 PMC: 10347826. DOI: 10.1186/s40793-023-00513-w.


References
1.
Venter J, Remington K, Heidelberg J, Halpern A, Rusch D, Eisen J . Environmental genome shotgun sequencing of the Sargasso Sea. Science. 2004; 304(5667):66-74. DOI: 10.1126/science.1093857. View

2.
Kunin V, Engelbrektson A, Ochman H, Hugenholtz P . Wrinkles in the rare biosphere: pyrosequencing errors can lead to artificial inflation of diversity estimates. Environ Microbiol. 2009; 12(1):118-23. DOI: 10.1111/j.1462-2920.2009.02051.x. View

3.
Legendre P, Gallagher E . Ecologically meaningful transformations for ordination of species data. Oecologia. 2017; 129(2):271-280. DOI: 10.1007/s004420100716. View

4.
Qin J, Li R, Raes J, Arumugam M, Burgdorf K, Manichanh C . A human gut microbial gene catalogue established by metagenomic sequencing. Nature. 2010; 464(7285):59-65. PMC: 3779803. DOI: 10.1038/nature08821. View

5.
Galand P, Casamayor E, Kirchman D, Lovejoy C . Ecology of the rare microbial biosphere of the Arctic Ocean. Proc Natl Acad Sci U S A. 2009; 106(52):22427-32. PMC: 2796907. DOI: 10.1073/pnas.0908284106. View