Fast and Robust Adjustment of Cell Mixtures in Epigenome-wide Association Studies with SmartSVA
Overview
Affiliations
Background: One problem that plagues epigenome-wide association studies is the potential confounding due to cell mixtures when purified target cells are not available. Reference-free adjustment of cell mixtures has become increasingly popular due to its flexibility and simplicity. However, existing methods are still not optimal: increased false positive rates and reduced statistical power have been observed in many scenarios.
Methods: We develop SmartSVA, an optimized surrogate variable analysis (SVA) method, for fast and robust reference-free adjustment of cell mixtures. SmartSVA corrects the limitation of traditional SVA under highly confounded scenarios by imposing an explicit convergence criterion and improves the computational efficiency for large datasets.
Results: Compared to traditional SVA, SmartSVA achieves an order-of-magnitude speedup and better false positive control. It protects the signals when capturing the cell mixtures, resulting in significant power increase while controlling for false positives. Through extensive simulations and real data applications, we demonstrate a better performance of SmartSVA than the existing methods.
Conclusions: SmartSVA is a fast and robust method for reference-free adjustment of cell mixtures for epigenome-wide association studies. As a general method, SmartSVA can be applied to other genomic studies to capture unknown sources of variability.
Barcelona V, Ray M, Zhao Y, Samari G, Wu H, Reho P BMJ Open. 2025; 15(3):e091801.
PMID: 40037666 PMC: 11881185. DOI: 10.1136/bmjopen-2024-091801.
Zhou J, Li M, Chen Y, Wang S, Wang D, Suo C Biol Sex Differ. 2024; 15(1):106.
PMID: 39716176 PMC: 11664931. DOI: 10.1186/s13293-024-00682-4.
Severe traumatic injury is associated with profound changes in DNA methylation.
Eskesen T, Almstrup K, Elgaard L, Arleth T, Lassen M, Creutzburg A NPJ Genom Med. 2024; 9(1):53.
PMID: 39487175 PMC: 11530621. DOI: 10.1038/s41525-024-00438-4.
Tindula G, Mukherjee S, Ekramullah S, Arman D, Islam J, Biswas S Epigenetics. 2024; 19(1):2416345.
PMID: 39425535 PMC: 11492674. DOI: 10.1080/15592294.2024.2416345.
Yap C, Vo D, Heffel M, Bhattacharya A, Wen C, Yang Y Sci Adv. 2024; 10(21):eadn7655.
PMID: 38781333 PMC: 11114225. DOI: 10.1126/sciadv.adn7655.