Machine Learning-based Reclassification of Germline Variants of Unknown Significance: The RENOVO Algorithm
Overview
Authors
Affiliations
The increasing scope of genetic testing allowed by next-generation sequencing (NGS) dramatically increased the number of genetic variants to be interpreted as pathogenic or benign for adequate patient management. Still, the interpretation process often fails to deliver a clear classification, resulting in either variants of unknown significance (VUSs) or variants with conflicting interpretation of pathogenicity (CIP); these represent a major clinical problem because they do not provide useful information for decision-making, causing a large fraction of genetically determined disease to remain undertreated. We developed a machine learning (random forest)-based tool, RENOVO, that classifies variants as pathogenic or benign on the basis of publicly available information and provides a pathogenicity likelihood score (PLS). Using the same feature classes recommended by guidelines, we trained RENOVO on established pathogenic/benign variants in ClinVar (training set accuracy = 99%) and tested its performance on variants whose interpretation has changed over time (test set accuracy = 95%). We further validated the algorithm on additional datasets including unreported variants validated either through expert consensus (ENIGMA) or laboratory-based functional techniques (on BRCA1/2 and SCN5A). On all datasets, RENOVO outperformed existing automated interpretation tools. On the basis of the above validation metrics, we assigned a defined PLS to all existing ClinVar VUSs, proposing a reclassification for 67% with >90% estimated precision. RENOVO provides a validated tool to reduce the fraction of uninterpreted or misinterpreted variants, tackling an area of unmet need in modern clinical genetics.
Chen Y, Hsiao T, Lin C, Fann Y J Biomed Sci. 2025; 32(1):16.
PMID: 39915780 PMC: 11804102. DOI: 10.1186/s12929-024-01110-w.
Das S, Patel V, Chakravarty S, Ghosh A, Mukhopadhyay A, Biswas N BioData Min. 2025; 18(1):7.
PMID: 39833905 PMC: 11744934. DOI: 10.1186/s13040-024-00420-x.
Rare disease genomics and precision medicine.
Hong J, Lee D, Hwang A, Kim T, Ryu H, Choi J Genomics Inform. 2024; 22(1):28.
PMID: 39627904 PMC: 11616305. DOI: 10.1186/s44342-024-00032-1.
Identification of Novel Potential Predisposing Variants in Familial Acute Myeloid Leukemia.
Ronchini C, Gigli F, Fontanini M, Furgi R, Amato V, Giglio F Cancer Rep (Hoboken). 2024; 7(8):e2141.
PMID: 39118233 PMC: 11310090. DOI: 10.1002/cnr2.2141.
Accuracy of renovo predictions on variants reclassified over time.
Bonetti E, Tini G, Mazzarella L J Transl Med. 2024; 22(1):713.
PMID: 39085881 PMC: 11293099. DOI: 10.1186/s12967-024-05508-w.