» Articles » PMID: 31402174

Large-Scale Analyses of Human Microbiomes Reveal Thousands of Small, Novel Genes

Overview
Journal Cell
Publisher Cell Press
Specialty Cell Biology
Date 2019 Aug 13
PMID 31402174
Citations 110
Authors
Affiliations
Soon will be listed here.
Abstract

Small proteins are traditionally overlooked due to computational and experimental difficulties in detecting them. To systematically identify small proteins, we carried out a comparative genomics study on 1,773 human-associated metagenomes from four different body sites. We describe >4,000 conserved protein families, the majority of which are novel; ∼30% of these protein families are predicted to be secreted or transmembrane. Over 90% of the small protein families have no known domain and almost half are not represented in reference genomes. We identify putative housekeeping, mammalian-specific, defense-related, and protein families that are likely to be horizontally transferred. We provide evidence of transcription and translation for a subset of these families. Our study suggests that small proteins are highly abundant and those of the human microbiome, in particular, may perform diverse functions that have not been previously reported.

Citing Articles

The hidden bacterial microproteome.

Fesenko I, Sahakyan H, Dhyani R, Shabalina S, Storz G, Koonin E Mol Cell. 2025; 85(5):1024-1041.e6.

PMID: 39978337 PMC: 11890958. DOI: 10.1016/j.molcel.2025.01.025.


sORFdb - a database for sORFs, small proteins, and small protein families in bacteria.

Hahnfeld J, Schwengers O, Jelonek L, Diedrich S, Cemic F, Goesmann A BMC Genomics. 2025; 26(1):110.

PMID: 39910485 PMC: 11796252. DOI: 10.1186/s12864-025-11301-w.


Mining microbiomes for microproteins.

Neville B, Lawley T Nat Rev Microbiol. 2025; 23(3):146.

PMID: 39885330 DOI: 10.1038/s41579-025-01154-1.


Deciphering the role of host-gut microbiota crosstalk via diverse sources of extracellular vesicles in colorectal cancer.

Song Y, Shi M, Wang Y Mol Med. 2024; 30(1):200.

PMID: 39501131 PMC: 11536884. DOI: 10.1186/s10020-024-00976-8.


Assessing fecal metaproteomics workflow and small protein recovery using DDA and DIA PASEF mass spectrometry.

Wang A, Fekete E, Creskey M, Cheng K, Ning Z, Pfeifle A Microbiome Res Rep. 2024; 3(3):39.

PMID: 39421247 PMC: 11480776. DOI: 10.20517/mrr.2024.21.


References
1.
Imperiali B, Ottesen J . Uniquely folded mini-protein motifs. J Pept Res. 1999; 54(3):177-84. DOI: 10.1034/j.1399-3011.1999.00121.x. View

2.
Gassel M, Mollenkamp T, Puppe W, Altendorf K . The KdpF subunit is part of the K(+)-translocating Kdp complex of Escherichia coli and is responsible for stabilization of the complex in vitro. J Biol Chem. 1999; 274(53):37901-7. DOI: 10.1074/jbc.274.53.37901. View

3.
Ochman H, Lawrence J, Groisman E . Lateral gene transfer and the nature of bacterial innovation. Nature. 2000; 405(6784):299-304. DOI: 10.1038/35012500. View

4.
McGuffin L, Bryson K, Jones D . The PSIPRED protein structure prediction server. Bioinformatics. 2000; 16(4):404-5. DOI: 10.1093/bioinformatics/16.4.404. View

5.
KROGH A, Larsson B, von Heijne G, Sonnhammer E . Predicting transmembrane protein topology with a hidden Markov model: application to complete genomes. J Mol Biol. 2001; 305(3):567-80. DOI: 10.1006/jmbi.2000.4315. View