» Articles » PMID: 27517583

A Systematic Analysis of the Structures of Heterologously Expressed Proteins and Those from Their Native Hosts in the RCSB PDB Archive

Overview
Journal PLoS One
Date 2016 Aug 13
PMID 27517583
Citations 4
Authors
Affiliations
Soon will be listed here.
Abstract

Recombinant expression of proteins has become an indispensable tool in modern day research. The large yields of recombinantly expressed proteins accelerate the structural and functional characterization of proteins. Nevertheless, there are literature reported that the recombinant proteins show some differences in structure and function as compared with the native ones. Now there have been more than 100,000 structures (from both recombinant and native sources) publicly available in the Protein Data Bank (PDB) archive, which makes it possible to investigate if there exist any proteins in the RCSB PDB archive that have identical sequence but have some difference in structures. In this paper, we present the results of a systematic comparative study of the 3D structures of identical naturally purified versus recombinantly expressed proteins. The structural data and sequence information of the proteins were mined from the RCSB PDB archive. The combinatorial extension (CE), FATCAT-flexible and TM-Align methods were employed to align the protein structures. The root-mean-square distance (RMSD), TM-score, P-value, Z-score, secondary structural elements and hydrogen bonds were used to assess the structure similarity. A thorough analysis of the PDB archive generated five-hundred-seventeen pairs of native and recombinant proteins that have identical sequence. There were no pairs of proteins that had the same sequence and significantly different structural fold, which support the hypothesis that expression in a heterologous host usually could fold correctly into their native forms.

Citing Articles

Thermotolerance Mechanism of Fungal GH6 Cellobiohydrolase. Part II. Structural Analysis of Thermotolerant Mutant from the Basidiomycete .

Yamaguchi S, Sunagawa N, Samejima M, Igarashi K J Appl Glycosci (1999). 2024; 71(2):63-72.

PMID: 38863950 PMC: 11163327. DOI: 10.5458/jag.jag.JAG-2023_0018.


Recombinant expression of insoluble enzymes in Escherichia coli: a systematic review of experimental design and its manufacturing implications.

Mital S, Christie G, Dikicioglu D Microb Cell Fact. 2021; 20(1):208.

PMID: 34717620 PMC: 8557517. DOI: 10.1186/s12934-021-01698-w.


Soluble Expression and Catalytic Properties of Codon-Optimized Recombinant Bromelain from MD2 Pineapple in Escherichia coli.

Razali R, Budiman C, Kamaruzaman K, Subbiah V Protein J. 2021; 40(3):406-418.

PMID: 33713245 DOI: 10.1007/s10930-021-09974-9.


Refolding and characterization of two G protein-coupled receptors purified from E. coli inclusion bodies.

Heim B, Handrick R, Hartmann M, Kiefer H PLoS One. 2021; 16(2):e0247689.

PMID: 33626080 PMC: 7904181. DOI: 10.1371/journal.pone.0247689.

References
1.
Ye Y, Godzik A . FATCAT: a web server for flexible structure comparison and structure similarity searching. Nucleic Acids Res. 2004; 32(Web Server issue):W582-5. PMC: 441568. DOI: 10.1093/nar/gkh430. View

2.
Zacharias J, Knapp E . Protein secondary structure classification revisited: processing DSSP information with PSSC. J Chem Inf Model. 2014; 54(7):2166-79. DOI: 10.1021/ci5000856. View

3.
Wurm F . Production of recombinant protein therapeutics in cultivated mammalian cells. Nat Biotechnol. 2004; 22(11):1393-8. DOI: 10.1038/nbt1026. View

4.
Hua L, Liu Y, Zhen S, Wan D, Cao J, Gao X . Expression and biochemical characterization of recombinant human epididymis protein 4. Protein Expr Purif. 2014; 102:52-62. DOI: 10.1016/j.pep.2014.08.004. View

5.
Berman H, Westbrook J, Feng Z, Gilliland G, Bhat T, Weissig H . The Protein Data Bank. Nucleic Acids Res. 1999; 28(1):235-42. PMC: 102472. DOI: 10.1093/nar/28.1.235. View