» Articles » PMID: 33800844

Protein-Coding Genes of Predominantly Present Purifying Selection Though Many Membrane Proteins Suffer from Selection Pressure: A Proposal to Analyze Bacterial Pangenomes

Overview
Journal Genes (Basel)
Publisher MDPI
Date 2021 Apr 3
PMID 33800844
Citations 2
Authors
Affiliations
Soon will be listed here.
Abstract

The current availability of complete genome sequences has allowed knowing that bacterial genomes can bear genes not present in the genome of all the strains from a specific species. So, the genes shared by all the strains comprise the core of the species, but the pangenome can be much greater and usually includes genes appearing in one only strain. Once the pangenome of a species is estimated, other studies can be undertaken to generate new knowledge, such as the study of the evolutionary selection for protein-coding genes. Most of the genes of a pangenome are expected to be subject to purifying selection that assures the conservation of function, especially those in the core group. However, some genes can be subject to selection pressure, such as genes involved in virulence that need to escape to the host immune system, which is more common in the accessory group of the pangenome. We analyzed 180 strains of , a bacterium that colonizes the gastric mucosa of half the world population and presents a low number of genes (around 1500 in a strain and 3000 in the pangenome). After the estimation of the pangenome, the evolutionary selection for each gene has been calculated, and we found that 85% of them are subject to purifying selection and the remaining genes present some grade of selection pressure. As expected, the latter group is enriched with genes encoding for membrane proteins putatively involved in interaction to host tissues. In addition, this group also presents a high number of uncharacterized genes and genes encoding for putative spurious proteins. It suggests that they could be false positives from the gene finders used for identifying them. All these results propose that this kind of analyses can be useful to validate gene predictions and functionally characterize proteins in complete genomes.

Citing Articles

The most exposed regions of SARS-CoV-2 structural proteins are subject to strong positive selection and gene overlap may locally modify this behavior.

Rubio A, de Toro M, Perez-Pulido A mSystems. 2023; 9(1):e0071323.

PMID: 38095866 PMC: 10804949. DOI: 10.1128/msystems.00713-23.


Comparative genomics of the genus reveals widespread diversity in genomic content and positive selection history.

Batarseh T, Batarseh S, Morales-Cruz A, Gaut B Front Microbiol. 2023; 14:1206094.

PMID: 37434713 PMC: 10330825. DOI: 10.3389/fmicb.2023.1206094.

References
1.
Tunca S, Barreiro C, Coque J, Martin J . Two overlapping antiparallel genes encoding the iron regulator DmdR1 and the Adm proteins control siderophore [correction of sedephore] and antibiotic biosynthesis in Streptomyces coelicolor A3(2). FEBS J. 2009; 276(17):4814-27. DOI: 10.1111/j.1742-4658.2009.07182.x. View

2.
Humbert O, Salama N . The Helicobacter pylori HpyAXII restriction-modification system limits exogenous DNA uptake by targeting GTAC sites but shows asymmetric conservation of the DNA methyltransferase and restriction endonuclease components. Nucleic Acids Res. 2008; 36(21):6893-906. PMC: 2588503. DOI: 10.1093/nar/gkn718. View

3.
Mir K, Schober S . Selection pressure in alternative reading frames. PLoS One. 2014; 9(10):e108768. PMC: 4182739. DOI: 10.1371/journal.pone.0108768. View

4.
Tatusova T, DiCuccio M, Badretdin A, Chetvernin V, Nawrocki E, Zaslavsky L . NCBI prokaryotic genome annotation pipeline. Nucleic Acids Res. 2016; 44(14):6614-24. PMC: 5001611. DOI: 10.1093/nar/gkw569. View

5.
Page A, Cummins C, Hunt M, Wong V, Reuter S, Holden M . Roary: rapid large-scale prokaryote pan genome analysis. Bioinformatics. 2015; 31(22):3691-3. PMC: 4817141. DOI: 10.1093/bioinformatics/btv421. View