» Articles » PMID: 35444317

The Human Pangenome Project: a Global Resource to Map Genomic Diversity

Abstract

The human reference genome is the most widely used resource in human genetics and is due for a major update. Its current structure is a linear composite of merged haplotypes from more than 20 people, with a single individual comprising most of the sequence. It contains biases and errors within a framework that does not represent global human genomic variation. A high-quality reference with global representation of common variants, including single-nucleotide variants, structural variants and functional elements, is needed. The Human Pangenome Reference Consortium aims to create a more sophisticated and complete human reference genome with a graph-based, telomere-to-telomere representation of global genomic diversity. Here we leverage innovations in technology, study design and global partnerships with the goal of constructing the highest-possible quality human pangenome reference. Our goal is to improve data representation and streamline analyses to enable routine assembly of complete diploid genomes. With attention to ethical frameworks, the human pangenome reference will contain a more accurate and diverse representation of global genomic variation, improve gene-disease association studies across populations, expand the scope of genomics research to the most repetitive and polymorphic regions of the genome, and serve as the ultimate genetic resource for future biomedical research and precision medicine.

Citing Articles

Improving genetic variant identification for quantitative traits using ensemble learning-based approaches.

Sharma J, Jangale V, Shekhawat R, Yadav P BMC Genomics. 2025; 26(1):237.

PMID: 40075256 PMC: 11899862. DOI: 10.1186/s12864-025-11443-x.


Equitable machine learning counteracts ancestral bias in precision medicine.

Smith L, Cahill J, Lee J, Graim K Nat Commun. 2025; 16(1):2144.

PMID: 40064867 PMC: 11894161. DOI: 10.1038/s41467-025-57216-8.


Genome-wide profiling of highly similar paralogous genes using HiFi sequencing.

Chen X, Baker D, Dolzhenko E, Devaney J, Noya J, Berlyoung A Nat Commun. 2025; 16(1):2340.

PMID: 40057485 PMC: 11890787. DOI: 10.1038/s41467-025-57505-2.


Evolution, genetic diversity, and health.

Palma-Martinez M, Posadas-Garcia Y, Shaukat A, Lopez-Angeles B, Sohail M Nat Med. 2025; .

PMID: 40055519 DOI: 10.1038/s41591-025-03558-1.


Mem-based pangenome indexing for k-mer queries.

Hwang S, Brown N, Ahmed O, Jenike K, Kovaka S, Schatz M Algorithms Mol Biol. 2025; 20(1):3.

PMID: 40025556 PMC: 11871630. DOI: 10.1186/s13015-025-00272-y.


References
2.
Gibbs R . The Human Genome Project changed everything. Nat Rev Genet. 2020; 21(10):575-576. PMC: 7413016. DOI: 10.1038/s41576-020-0275-3. View

3.
Venter J, Adams M, Myers E, Li P, Mural R, Sutton G . The sequence of the human genome. Science. 2001; 291(5507):1304-51. DOI: 10.1126/science.1058040. View

4.
Green R, Krause J, Briggs A, Maricic T, Stenzel U, Kircher M . A draft sequence of the Neandertal genome. Science. 2010; 328(5979):710-722. PMC: 5100745. DOI: 10.1126/science.1188021. View

5.
Sherman R, Salzberg S . Pan-genomics in the human genome era. Nat Rev Genet. 2020; 21(4):243-254. PMC: 7752153. DOI: 10.1038/s41576-020-0210-7. View

6.
Rhie A, McCarthy S, Fedrigo O, Damas J, Formenti G, Koren S . Towards complete and error-free genome assemblies of all vertebrate species. Nature. 2021; 592(7856):737-746. PMC: 8081667. DOI: 10.1038/s41586-021-03451-0. View