» Articles » PMID: 33761318

Machine Learning-based Reclassification of Germline Variants of Unknown Significance: The RENOVO Algorithm

Overview
Journal Am J Hum Genet
Publisher Cell Press
Specialty Genetics
Date 2021 Mar 24
PMID 33761318
Citations 16
Authors
Affiliations
Soon will be listed here.
Abstract

The increasing scope of genetic testing allowed by next-generation sequencing (NGS) dramatically increased the number of genetic variants to be interpreted as pathogenic or benign for adequate patient management. Still, the interpretation process often fails to deliver a clear classification, resulting in either variants of unknown significance (VUSs) or variants with conflicting interpretation of pathogenicity (CIP); these represent a major clinical problem because they do not provide useful information for decision-making, causing a large fraction of genetically determined disease to remain undertreated. We developed a machine learning (random forest)-based tool, RENOVO, that classifies variants as pathogenic or benign on the basis of publicly available information and provides a pathogenicity likelihood score (PLS). Using the same feature classes recommended by guidelines, we trained RENOVO on established pathogenic/benign variants in ClinVar (training set accuracy = 99%) and tested its performance on variants whose interpretation has changed over time (test set accuracy = 95%). We further validated the algorithm on additional datasets including unreported variants validated either through expert consensus (ENIGMA) or laboratory-based functional techniques (on BRCA1/2 and SCN5A). On all datasets, RENOVO outperformed existing automated interpretation tools. On the basis of the above validation metrics, we assigned a defined PLS to all existing ClinVar VUSs, proposing a reclassification for 67% with >90% estimated precision. RENOVO provides a validated tool to reduce the fraction of uninterpreted or misinterpreted variants, tackling an area of unmet need in modern clinical genetics.

Citing Articles

Unlocking precision medicine: clinical applications of integrating health records, genetics, and immunology through artificial intelligence.

Chen Y, Hsiao T, Lin C, Fann Y J Biomed Sci. 2025; 32(1):16.

PMID: 39915780 PMC: 11804102. DOI: 10.1186/s12929-024-01110-w.


An ensemble machine learning-based performance evaluation identifies top In-Silico pathogenicity prediction methods that best classify driver mutations in cancer.

Das S, Patel V, Chakravarty S, Ghosh A, Mukhopadhyay A, Biswas N BioData Min. 2025; 18(1):7.

PMID: 39833905 PMC: 11744934. DOI: 10.1186/s13040-024-00420-x.


Rare disease genomics and precision medicine.

Hong J, Lee D, Hwang A, Kim T, Ryu H, Choi J Genomics Inform. 2024; 22(1):28.

PMID: 39627904 PMC: 11616305. DOI: 10.1186/s44342-024-00032-1.


Identification of Novel Potential Predisposing Variants in Familial Acute Myeloid Leukemia.

Ronchini C, Gigli F, Fontanini M, Furgi R, Amato V, Giglio F Cancer Rep (Hoboken). 2024; 7(8):e2141.

PMID: 39118233 PMC: 11310090. DOI: 10.1002/cnr2.2141.


Accuracy of renovo predictions on variants reclassified over time.

Bonetti E, Tini G, Mazzarella L J Transl Med. 2024; 22(1):713.

PMID: 39085881 PMC: 11293099. DOI: 10.1186/s12967-024-05508-w.


References
1.
Esterling L, Wijayatunge R, Brown K, Morris B, Hughes E, Pruss D . Impact of a Cancer Gene Variant Reclassification Program Over a 20-Year Period. JCO Precis Oncol. 2020; 4. PMC: 7469614. DOI: 10.1200/PO.20.00020. View

2.
Thorsen-Meyer H, Nielsen A, Nielsen A, Kaas-Hansen B, Toft P, Schierbeck J . Dynamic and explainable machine learning prediction of mortality in patients in the intensive care unit: a retrospective study of high-frequency data in electronic patient records. Lancet Digit Health. 2020; 2(4):e179-e191. DOI: 10.1016/S2589-7500(20)30018-2. View

3.
Landrum M, Kattman B . ClinVar at five years: Delivering on the promise. Hum Mutat. 2018; 39(11):1623-1630. PMC: 11567647. DOI: 10.1002/humu.23641. View

4.
Findlay G, Boyle E, Hause R, Klein J, Shendure J . Saturation editing of genomic regions by multiplex homology-directed repair. Nature. 2014; 513(7516):120-3. PMC: 4156553. DOI: 10.1038/nature13695. View

5.
Wang K, Li M, Hakonarson H . ANNOVAR: functional annotation of genetic variants from high-throughput sequencing data. Nucleic Acids Res. 2010; 38(16):e164. PMC: 2938201. DOI: 10.1093/nar/gkq603. View