» Articles » PMID: 20075913

Genome Sequence of the Palaeopolyploid Soybean

Abstract

Soybean (Glycine max) is one of the most important crop plants for seed protein and oil content, and for its capacity to fix atmospheric nitrogen through symbioses with soil-borne microorganisms. We sequenced the 1.1-gigabase genome by a whole-genome shotgun approach and integrated it with physical and high-density genetic maps to create a chromosome-scale draft sequence assembly. We predict 46,430 protein-coding genes, 70% more than Arabidopsis and similar to the poplar genome which, like soybean, is an ancient polyploid (palaeopolyploid). About 78% of the predicted genes occur in chromosome ends, which comprise less than one-half of the genome but account for nearly all of the genetic recombination. Genome duplications occurred at approximately 59 and 13 million years ago, resulting in a highly duplicated genome with nearly 75% of the genes present in multiple copies. The two duplication events were followed by gene diversification and loss, and numerous chromosome rearrangements. An accurate soybean genome sequence will facilitate the identification of the genetic basis of many soybean traits, and accelerate the creation of improved soybean varieties.

Citing Articles

A multidisciplinary and integrative review of the structural genome and epigenome of Capsicum L. species.

de Almeida B, Clarindo W Planta. 2025; 261(4):82.

PMID: 40057910 DOI: 10.1007/s00425-025-04653-w.


Divergent evolutionary paces among eudicot plants revealed by simultaneously duplicated genes produced billions of years ago.

Wang Y, Wang J, Li Y, Jin Y, Wang X Front Plant Sci. 2025; 16:1518981.

PMID: 40041022 PMC: 11876125. DOI: 10.3389/fpls.2025.1518981.


The genomic landscape of gene-level structural variations in Japanese and global soybean Glycine max cultivars.

Yano R, Li F, Hiraga S, Takeshima R, Kobayashi M, Toda K Nat Genet. 2025; .

PMID: 40033060 DOI: 10.1038/s41588-025-02113-5.


The Genome of the Lima Bean Variety Baiyu Bean Highlights Its Evolutionary Characteristics.

Li F, Liu J, Dewer Y, Ahsan M, Wu C Ecol Evol. 2025; 15(3):e71027.

PMID: 40027412 PMC: 11868737. DOI: 10.1002/ece3.71027.


Genome-Wide Identification and Expression Analyses of Genes During Nodule Symbiosis in .

Li R, Gou C, Zhang K, He M, Li L, Kong F Int J Mol Sci. 2025; 26(4).

PMID: 40004114 PMC: 11855358. DOI: 10.3390/ijms26041649.


References
1.
Tang H, Wang X, Bowers J, Ming R, Alam M, Paterson A . Unraveling ancient hexaploidy through multiply-aligned angiosperm gene maps. Genome Res. 2008; 18(12):1944-54. PMC: 2593578. DOI: 10.1101/gr.080978.108. View

2.
Roy S, Penny D . Patterns of intron loss and gain in plants: intron loss-dominated evolution and genome-wide comparison of O. sativa and A. thaliana. Mol Biol Evol. 2006; 24(1):171-81. DOI: 10.1093/molbev/msl159. View

3.
Paterson A, Bowers J, Bruggmann R, Dubchak I, Grimwood J, Gundlach H . The Sorghum bicolor genome and the diversification of grasses. Nature. 2009; 457(7229):551-6. DOI: 10.1038/nature07723. View

4.
Wang B, Brendel V . Genomewide comparative analysis of alternative splicing in plants. Proc Natl Acad Sci U S A. 2006; 103(18):7175-80. PMC: 1459036. DOI: 10.1073/pnas.0602039103. View

5.
Lai C, Lee C, Chen P, Wu S, Yang C, Shaw J . Molecular analyses of the Arabidopsis TUBBY-like protein gene family. Plant Physiol. 2004; 134(4):1586-97. PMC: 419833. DOI: 10.1104/pp.103.037820. View