» Articles » PMID: 39572521

Deep Generative AI Models Analyzing Circulating Orphan Non-coding RNAs Enable Detection of Early-stage Lung Cancer

Abstract

Liquid biopsies have the potential to revolutionize cancer care through non-invasive early detection of tumors. Developing a robust liquid biopsy test requires collecting high-dimensional data from a large number of blood samples across heterogeneous groups of patients. We propose that the generative capability of variational auto-encoders enables learning a robust and generalizable signature of blood-based biomarkers. In this study, we analyze orphan non-coding RNAs (oncRNAs) from serum samples of 1050 individuals diagnosed with non-small cell lung cancer (NSCLC) at various stages, as well as sex-, age-, and BMI-matched controls. We demonstrate that our multi-task generative AI model, Orion, surpasses commonly used methods in both overall performance and generalizability to held-out datasets. Orion achieves an overall sensitivity of 94% (95% CI: 87%-98%) at 87% (95% CI: 81%-93%) specificity for cancer detection across all stages, outperforming the sensitivity of other methods on held-out validation datasets by more than  ~ 30%.

Citing Articles

The NcRNA/Wnt axis in lung cancer: oncogenic mechanisms, remarkable indicators and therapeutic targets.

Zhong Y, He J, Huang C, Lai H, Li X, Zheng C J Transl Med. 2025; 23(1):326.

PMID: 40087753 DOI: 10.1186/s12967-025-06326-4.


Deep generative AI models analyzing circulating orphan non-coding RNAs enable detection of early-stage lung cancer.

Karimzadeh M, Momen-Roknabadi A, Cavazos T, Fang Y, Chen N, Multhaup M Nat Commun. 2024; 15(1):10090.

PMID: 39572521 PMC: 11582319. DOI: 10.1038/s41467-024-53851-9.

References
1.
Bonfield J, Marshall J, Danecek P, Li H, Ohan V, Whitwham A . HTSlib: C library for reading/writing high-throughput sequencing data. Gigascience. 2021; 10(2). PMC: 7931820. DOI: 10.1093/gigascience/giab007. View

2.
Fish L, Zhang S, Yu J, Culbertson B, Zhou A, Goga A . Cancer cells exploit an orphan RNA to drive metastatic progression. Nat Med. 2018; 24(11):1743-1751. PMC: 6223318. DOI: 10.1038/s41591-018-0230-4. View

3.
Mazzone P, Bach P, Carey J, Schonewolf C, Bognar K, Ahluwalia M . Clinical Validation of a Cell-Free DNA Fragmentome Assay for Augmentation of Lung Cancer Early Detection. Cancer Discov. 2024; 14(11):2224-2242. PMC: 11528203. DOI: 10.1158/2159-8290.CD-24-0519. View

4.
Lopez R, Regier J, Cole M, Jordan M, Yosef N . Deep generative modeling for single-cell transcriptomics. Nat Methods. 2018; 15(12):1053-1058. PMC: 6289068. DOI: 10.1038/s41592-018-0229-2. View

5.
Phallen J, Sausen M, Adleff V, Leal A, Hruban C, White J . Direct detection of early-stage cancers using circulating tumor DNA. Sci Transl Med. 2017; 9(403). PMC: 6714979. DOI: 10.1126/scitranslmed.aan2415. View