» Articles » PMID: 36613666

A Multi-Year, Multi-Cultivar Approach to Differential Expression Analysis of High- and Low-Protein Soybean ()

Abstract

Soybean ( (L.) Merr.) is among the most valuable crops based on its nutritious seed protein and oil. Protein quality, evaluated as the ratio of glycinin (11S) to β-conglycinin (7S), can play a role in food and feed quality. To help uncover the underlying differences between high and low protein soybean varieties, we performed differential expression analysis on high and low total protein soybean varieties and high and low 11S soybean varieties grown in four locations across Eastern and Western Canada over three years (2018-2020). Simultaneously, ten individual differential expression datasets for high vs. low total protein soybeans and ten individual differential expression datasets for high vs. low 11S soybeans were assessed, for a total of 20 datasets. The top 15 most upregulated and the 15 most downregulated genes were extracted from each differential expression dataset and cross-examination was conducted to create shortlists of the most consistently differentially expressed genes. Shortlisted genes were assessed for gene ontology to gain a global appreciation of the commonly differentially expressed genes. Genes with roles in the lipid metabolic pathway and carbohydrate metabolic pathway were differentially expressed in high total protein and high 11S soybeans in comparison to their low total protein and low 11S counterparts. Expression differences were consistent between East and West locations with the exception of one, . These data are important for uncovering the genes and biological pathways responsible for the difference in seed protein between high and low total protein or 11S cultivars.

Citing Articles

Differential gene expression provides leads to environmentally regulated soybean seed protein content.

Hooker J, Smith M, Zapata G, Charette M, Luckert D, Mohr R Front Plant Sci. 2023; 14:1260393.

PMID: 37790790 PMC: 10544915. DOI: 10.3389/fpls.2023.1260393.

References
1.
Anders S, Pyl P, Huber W . HTSeq--a Python framework to work with high-throughput sequencing data. Bioinformatics. 2014; 31(2):166-9. PMC: 4287950. DOI: 10.1093/bioinformatics/btu638. View

2.
Herridge R, Day R, Macknight R . The role of the MCM2-7 helicase complex during Arabidopsis seed development. Plant Mol Biol. 2014; 86(1-2):69-84. DOI: 10.1007/s11103-014-0213-x. View

3.
Lanoix J, Ouwendijk J, Lin C, Stark A, Love H, Ostermann J . GTP hydrolysis by arf-1 mediates sorting and concentration of Golgi resident enzymes into functional COP I vesicles. EMBO J. 1999; 18(18):4935-48. PMC: 1171565. DOI: 10.1093/emboj/18.18.4935. View

4.
Huang S, Yu J, Li Y, Wang J, Wang X, Qi H . Identification of Soybean Genes Related to Soybean Seed Protein Content Based on Quantitative Trait Loci Collinearity Analysis. J Agric Food Chem. 2018; 67(1):258-274. DOI: 10.1021/acs.jafc.8b04602. View

5.
Chern M, Eiben H, Bustos M . The developmentally regulated bZIP factor ROM1 modulates transcription from lectin and storage protein genes in bean embryos. Plant J. 1996; 10(1):135-48. DOI: 10.1046/j.1365-313x.1996.10010135.x. View