» Articles » PMID: 38951922

Benchmarking Computational Variant Effect Predictors by Their Ability to Infer Human Traits

Overview
Journal Genome Biol
Specialties Biology
Genetics
Date 2024 Jul 2
PMID 38951922
Authors
Affiliations
Soon will be listed here.
Abstract

Background: Computational variant effect predictors offer a scalable and increasingly reliable means of interpreting human genetic variation, but concerns of circularity and bias have limited previous methods for evaluating and comparing predictors. Population-level cohorts of genotyped and phenotyped participants that have not been used in predictor training can facilitate an unbiased benchmarking of available methods. Using a curated set of human gene-trait associations with a reported rare-variant burden association, we evaluate the correlations of 24 computational variant effect predictors with associated human traits in the UK Biobank and All of Us cohorts.

Results: AlphaMissense outperformed all other predictors in inferring human traits based on rare missense variants in UK Biobank and All of Us participants. The overall rankings of computational variant effect predictors in these two cohorts showed a significant positive correlation.

Conclusion: We describe a method to assess computational variant effect predictors that sidesteps the limitations of previous evaluations. This approach is generalizable to future predictors and could continue to inform predictor choice for personal and clinical genetics.

References
1.
Vaser R, Adusumalli S, Leng S, Sikic M, Ng P . SIFT missense predictions for genomes. Nat Protoc. 2015; 11(1):1-9. DOI: 10.1038/nprot.2015.123. View

2.
Cirulli E, White S, Read R, Elhanan G, Metcalf W, Tanudjaja F . Genome-wide rare variant analysis for thousands of phenotypes in over 70,000 exomes from two cohorts. Nat Commun. 2020; 11(1):542. PMC: 6987107. DOI: 10.1038/s41467-020-14288-y. View

3.
Chun S, Fay J . Identification of deleterious mutations within three human genomes. Genome Res. 2009; 19(9):1553-61. PMC: 2752137. DOI: 10.1101/gr.092619.109. View

4.
Karczewski K, Francioli L, Tiao G, Cummings B, Alfoldi J, Wang Q . The mutational constraint spectrum quantified from variation in 141,456 humans. Nature. 2020; 581(7809):434-443. PMC: 7334197. DOI: 10.1038/s41586-020-2308-7. View

5.
Richards S, Aziz N, Bale S, Bick D, Das S, Gastier-Foster J . Standards and guidelines for the interpretation of sequence variants: a joint consensus recommendation of the American College of Medical Genetics and Genomics and the Association for Molecular Pathology. Genet Med. 2015; 17(5):405-24. PMC: 4544753. DOI: 10.1038/gim.2015.30. View