» Articles » PMID: 35616118

GrainGenes: a Data-rich Repository for Small Grains Genetics and Genomics

Abstract

As one of the US Department of Agriculture-Agricultural Research Service flagship databases, GrainGenes (https://wheat.pw.usda.gov) serves the data and community needs of globally distributed small grains researchers for the genetic improvement of the Triticeae family and Avena species that include wheat, barley, rye and oat. GrainGenes accomplishes its mission by continually enriching its cross-linked data content following the findable, accessible, interoperable and reusable principles, enhancing and maintaining an intuitive web interface, creating tools to enable easy data access and establishing data connections within and between GrainGenes and other biological databases to facilitate knowledge discovery. GrainGenes operates within the biological database community, collaborates with curators and genome sequencing groups and contributes to the AgBioData Consortium and the International Wheat Initiative through the Wheat Information System (WheatIS). Interactive and linked content is paramount for successful biological databases and GrainGenes now has 2917 manually curated gene records, including 289 genes and 254 alleles from the Wheat Gene Catalogue (WGC). There are >4.8 million gene models in 51 genome browser assemblies, 6273 quantitative trait loci and >1.4 million genetic loci on 4756 genetic and physical maps contained within 443 mapping sets, complete with standardized metadata. Most notably, 50 new genome browsers that include outputs from the Wheat and Barley PanGenome projects have been created. We provide an example of an expression quantitative trait loci track on the International Wheat Genome Sequencing Consortium Chinese Spring wheat browser to demonstrate how genome browser tracks can be adapted for different data types. To help users benefit more from its data, GrainGenes created four tutorials available on YouTube. GrainGenes is executing its vision of service by continuously responding to the needs of the global small grains community by creating a centralized, long-term, interconnected data repository. Database URL:https://wheat.pw.usda.gov.

Citing Articles

A large-scale multi-environment study dissecting adult-plant resistance haplotypes for stripe rust resistance in Australian wheat breeding populations.

Vo Van-Zivkovic N, Dinglasan E, Tong J, Watt C, Goody J, Mullan D Theor Appl Genet. 2025; 138(4):72.

PMID: 40080143 PMC: 11906565. DOI: 10.1007/s00122-025-04859-2.


Assessing the performance of generative artificial intelligence in retrieving information against manually curated genetic and genomic data.

Poretsky E, Blake V, Andorf C, Sen T Database (Oxford). 2025; 2025.

PMID: 39963877 PMC: 11833239. DOI: 10.1093/database/baaf011.


Building resource-efficient community databases using open-source software.

Jung S, Cheng C, Lee T, Buble K, Humann J, Zheng P Database (Oxford). 2025; 2025.

PMID: 39937662 PMC: 11833237. DOI: 10.1093/database/baaf005.


AutoXAI4Omics: an automated explainable AI tool for omics and tabular data.

Strudwick J, Gardiner L, Denning-James K, Haiminen N, Evans A, Kelly J Brief Bioinform. 2024; 26(1).

PMID: 39576223 PMC: 11583442. DOI: 10.1093/bib/bbae593.


Identification of Genomic Regions Conferring Enhanced Zn and Fe Concentration in Wheat Varieties and Introgression Lines Derived from Wild Relatives.

Leonova I, Kiseleva A, Salina E Int J Mol Sci. 2024; 25(19).

PMID: 39408887 PMC: 11477371. DOI: 10.3390/ijms251910556.


References
1.
Reina C, Cavalieri V . Epigenetic Modulation of Chromatin States and Gene Expression by G-Quadruplex Structures. Int J Mol Sci. 2020; 21(11). PMC: 7312119. DOI: 10.3390/ijms21114172. View

2.
Dash S, Campbell J, Cannon E, Cleary A, Huang W, Kalberer S . Legume information system (LegumeInfo.org): a key component of a set of federated data resources for the legume family. Nucleic Acids Res. 2015; 44(D1):D1181-8. PMC: 4702835. DOI: 10.1093/nar/gkv1159. View

3.
Cagirici H, Budak H, Sen T . Genome-wide discovery of G-quadruplexes in barley. Sci Rep. 2021; 11(1):7876. PMC: 8041835. DOI: 10.1038/s41598-021-86838-3. View

4.
Maughan P, Lee R, Walstead R, Vickerstaff R, Fogarty M, Brouwer C . Genomic insights from the first chromosome-scale assemblies of oat (Avena spp.) diploid species. BMC Biol. 2019; 17(1):92. PMC: 6874827. DOI: 10.1186/s12915-019-0712-y. View

5.
Jordan K, He F, de Soto M, Akhunova A, Akhunov E . Differential chromatin accessibility landscape reveals structural and functional features of the allopolyploid wheat chromosomes. Genome Biol. 2020; 21(1):176. PMC: 7368981. DOI: 10.1186/s13059-020-02093-1. View