» Articles » PMID: 28549425

Fast and Robust Adjustment of Cell Mixtures in Epigenome-wide Association Studies with SmartSVA

Overview
Journal BMC Genomics
Publisher Biomed Central
Specialty Genetics
Date 2017 May 28
PMID 28549425
Citations 43
Authors
Affiliations
Soon will be listed here.
Abstract

Background: One problem that plagues epigenome-wide association studies is the potential confounding due to cell mixtures when purified target cells are not available. Reference-free adjustment of cell mixtures has become increasingly popular due to its flexibility and simplicity. However, existing methods are still not optimal: increased false positive rates and reduced statistical power have been observed in many scenarios.

Methods: We develop SmartSVA, an optimized surrogate variable analysis (SVA) method, for fast and robust reference-free adjustment of cell mixtures. SmartSVA corrects the limitation of traditional SVA under highly confounded scenarios by imposing an explicit convergence criterion and improves the computational efficiency for large datasets.

Results: Compared to traditional SVA, SmartSVA achieves an order-of-magnitude speedup and better false positive control. It protects the signals when capturing the cell mixtures, resulting in significant power increase while controlling for false positives. Through extensive simulations and real data applications, we demonstrate a better performance of SmartSVA than the existing methods.

Conclusions: SmartSVA is a fast and robust method for reference-free adjustment of cell mixtures for epigenome-wide association studies. As a general method, SmartSVA can be applied to other genomic studies to capture unknown sources of variability.

Citing Articles

Epigenomic pathways from racism to preterm birth: secondary analysis of the Nulliparous Pregnancy Outcomes Study: monitoring Mothers-to-be (nuMoM2b) cohort study in the USA to examine how DNA methylation mediates the relationship between multilevel....

Barcelona V, Ray M, Zhao Y, Samari G, Wu H, Reho P BMJ Open. 2025; 15(3):e091801.

PMID: 40037666 PMC: 11881185. DOI: 10.1136/bmjopen-2024-091801.


Attenuated sex-related DNA methylation differences in cancer highlight the magnitude bias mediating existing disparities.

Zhou J, Li M, Chen Y, Wang S, Wang D, Suo C Biol Sex Differ. 2024; 15(1):106.

PMID: 39716176 PMC: 11664931. DOI: 10.1186/s13293-024-00682-4.


Severe traumatic injury is associated with profound changes in DNA methylation.

Eskesen T, Almstrup K, Elgaard L, Arleth T, Lassen M, Creutzburg A NPJ Genom Med. 2024; 9(1):53.

PMID: 39487175 PMC: 11530621. DOI: 10.1038/s41525-024-00438-4.


Parental arsenic exposure and tissue-specific DNA methylation in Bangladeshi infants with spina bifida.

Tindula G, Mukherjee S, Ekramullah S, Arman D, Islam J, Biswas S Epigenetics. 2024; 19(1):2416345.

PMID: 39425535 PMC: 11492674. DOI: 10.1080/15592294.2024.2416345.


Brain cell-type shifts in Alzheimer's disease, autism, and schizophrenia interrogated using methylomics and genetics.

Yap C, Vo D, Heffel M, Bhattacharya A, Wen C, Yang Y Sci Adv. 2024; 10(21):eadn7655.

PMID: 38781333 PMC: 11114225. DOI: 10.1126/sciadv.adn7655.


References
1.
Repsilber D, Kern S, Telaar A, Walzl G, Black G, Selbig J . Biomarker discovery in heterogeneous tissue samples -taking the in-silico deconfounding approach. BMC Bioinformatics. 2010; 11:27. PMC: 3098067. DOI: 10.1186/1471-2105-11-27. View

2.
Reynolds L, Taylor J, Ding J, Lohman K, Johnson C, Siscovick D . Age-related variations in the methylome associated with gene expression in human monocytes and T cells. Nat Commun. 2014; 5:5366. PMC: 4280798. DOI: 10.1038/ncomms6366. View

3.
Teschendorff A, Zhuang J, Widschwendter M . Independent surrogate variable analysis to deconvolve confounding factors in large-scale microarray profiling studies. Bioinformatics. 2011; 27(11):1496-505. DOI: 10.1093/bioinformatics/btr171. View

4.
Wu M, Joubert B, Kuan P, Haberg S, Nystad W, Peddada S . A systematic assessment of normalization approaches for the Infinium 450K methylation platform. Epigenetics. 2013; 9(2):318-29. PMC: 3962542. DOI: 10.4161/epi.27119. View

5.
Gagnon-Bartsch J, Speed T . Using control genes to correct for unwanted variation in microarray data. Biostatistics. 2011; 13(3):539-52. PMC: 3577104. DOI: 10.1093/biostatistics/kxr034. View