» Articles » PMID: 37059810

Variant Calling and Benchmarking in an Era of Complete Human Genome Sequences

Overview
Journal Nat Rev Genet
Specialty Genetics
Date 2023 Apr 14
PMID 37059810
Authors
Affiliations
Soon will be listed here.
Abstract

Genetic variant calling from DNA sequencing has enabled understanding of germline variation in hundreds of thousands of humans. Sequencing technologies and variant-calling methods have advanced rapidly, routinely providing reliable variant calls in most of the human genome. We describe how advances in long reads, deep learning, de novo assembly and pangenomes have expanded access to variant calls in increasingly challenging, repetitive genomic regions, including medically relevant regions, and how new benchmark sets and benchmarking methods illuminate their strengths and limitations. Finally, we explore the possible future of more complete characterization of human genome variation in light of the recent completion of a telomere-to-telomere human genome reference assembly and human pangenomes, and we consider the innovations needed to benchmark their newly accessible repetitive regions and complex variants.

Citing Articles

Systematic benchmarking of tools for structural variation detection using short- and long-read sequencing data in pigs.

He S, Song B, Tang Y, Qu X, Li X, Yang X iScience. 2025; 28(3):111983.

PMID: 40060913 PMC: 11889634. DOI: 10.1016/j.isci.2025.111983.


Integration of proteomics profiling data to facilitate discovery of cancer neoantigens: a survey.

Luo S, Peng H, Shi Y, Cai J, Zhang S, Shao N Brief Bioinform. 2025; 26(2).

PMID: 40052441 PMC: 11886573. DOI: 10.1093/bib/bbaf087.


Chromosome-Level Genome Assembly of the Meishan Pig and Insights into Its Domestication Mechanisms.

Du H, Hu J, Zhang Z, Wu Z Animals (Basel). 2025; 15(4).

PMID: 40003085 PMC: 11851914. DOI: 10.3390/ani15040603.


Advancing long-read nanopore genome assembly and accurate variant calling for rare disease detection.

Negi S, Stenton S, Berger S, Canigiula P, McNulty B, Violich I Am J Hum Genet. 2025; 112(2):428-449.

PMID: 39862869 PMC: 11866955. DOI: 10.1016/j.ajhg.2025.01.002.


Quantitative Analysis of Pseudogene-Associated Errors During Germline Variant Calling.

Podvalnyi A, Kopernik A, Sayganova M, Woroncow M, Zobkova G, Smirnova A Int J Mol Sci. 2025; 26(1.

PMID: 39796219 PMC: 11719938. DOI: 10.3390/ijms26010363.


References
1.
Olson N, Wagner J, McDaniel J, Stephens S, Westreich S, Prasanna A . PrecisionFDA Truth Challenge V2: Calling variants from short and long reads in difficult-to-map regions. Cell Genom. 2022; 2(5). PMC: 9205427. DOI: 10.1016/j.xgen.2022.100129. View

2.
Pan B, Ren L, Onuchic V, Guan M, Kusko R, Bruinsma S . Assessing reproducibility of inherited variants detected with short-read whole genome sequencing. Genome Biol. 2022; 23(1):2. PMC: 8722114. DOI: 10.1186/s13059-021-02569-8. View

3.
Foox J, Tighe S, Nicolet C, Zook J, Byrska-Bishop M, Clarke W . Performance assessment of DNA sequencing platforms in the ABRF Next-Generation Sequencing Study. Nat Biotechnol. 2021; 39(9):1129-1140. PMC: 8985210. DOI: 10.1038/s41587-021-01049-5. View

4.
Jain M, Koren S, Miga K, Quick J, Rand A, Sasani T . Nanopore sequencing and assembly of a human genome with ultra-long reads. Nat Biotechnol. 2018; 36(4):338-345. PMC: 5889714. DOI: 10.1038/nbt.4060. View

5.
Wenger A, Peluso P, Rowell W, Chang P, Hall R, Concepcion G . Accurate circular consensus long-read sequencing improves variant detection and assembly of a human genome. Nat Biotechnol. 2019; 37(10):1155-1162. PMC: 6776680. DOI: 10.1038/s41587-019-0217-9. View