» Articles » PMID: 31494793

Validated Inference of Smoking Habits from Blood with a Finite DNA Methylation Marker Set

Abstract

Inferring a person's smoking habit and history from blood is relevant for complementing or replacing self-reports in epidemiological and public health research, and for forensic applications. However, a finite DNA methylation marker set and a validated statistical model based on a large dataset are not yet available. Employing 14 epigenome-wide association studies for marker discovery, and using data from six population-based cohorts (N = 3764) for model building, we identified 13 CpGs most suitable for inferring smoking versus non-smoking status from blood with a cumulative Area Under the Curve (AUC) of 0.901. Internal fivefold cross-validation yielded an average AUC of 0.897 ± 0.137, while external model validation in an independent population-based cohort (N = 1608) achieved an AUC of 0.911. These 13 CpGs also provided accurate inference of current (average AUC 0.925 ± 0.021, AUC0.914), former (0.766 ± 0.023, 0.699) and never smoking (0.830 ± 0.019, 0.781) status, allowed inferring pack-years in current smokers (10 pack-years 0.800 ± 0.068, 0.796; 15 pack-years 0.767 ± 0.102, 0.752) and inferring smoking cessation time in former smokers (5 years 0.774 ± 0.024, 0.760; 10 years 0.766 ± 0.033, 0.764; 15 years 0.767 ± 0.020, 0.754). Model application to children revealed highly accurate inference of the true non-smoking status (6 years of age: accuracy 0.994, N = 355; 10 years: 0.994, N = 309), suggesting prenatal and passive smoking exposure having no impact on model applications in adults. The finite set of DNA methylation markers allow accurate inference of smoking habit, with comparable accuracy as plasma cotinine use, and smoking history from blood, which we envision becoming useful in epidemiology and public health research, and in medical and forensic applications.

Citing Articles

DNA methylation at AHRR as a master predictor of smoke exposure and a biomarker for sleep and exercise.

Pospiech E, Rudnicka J, Noroozi R, Pisarek-Pacek A, Wysocka B, Masny A Clin Epigenetics. 2024; 16(1):147.

PMID: 39425209 PMC: 11490037. DOI: 10.1186/s13148-024-01757-0.


Phenotype prediction using biologically interpretable neural networks on multi-cohort multi-omics data.

van Hilten A, van Rooij J, Ikram M, Niessen W, van Meurs J, Roshchupkin G NPJ Syst Biol Appl. 2024; 10(1):81.

PMID: 39095438 PMC: 11297229. DOI: 10.1038/s41540-024-00405-w.


Uncovering Forensic Evidence: A Path to Age Estimation through DNA Methylation.

Castagnola M, Medina-Paz F, Zapico S Int J Mol Sci. 2024; 25(9).

PMID: 38732129 PMC: 11084977. DOI: 10.3390/ijms25094917.


Linking Prenatal Environmental Exposures to Lifetime Health with Epigenome-Wide Association Studies: State-of-the-Science Review and Future Recommendations.

Bakulski K, Blostein F, London S Environ Health Perspect. 2023; 131(12):126001.

PMID: 38048101 PMC: 10695268. DOI: 10.1289/EHP12956.


Epigenetic biomarkers for smoking cessation.

Fang F, Andersen A, Philibert R, Hancock D Addict Neurosci. 2023; 6.

PMID: 37123087 PMC: 10136056. DOI: 10.1016/j.addicn.2023.100079.


References
1.
Gao X, Jia M, Zhang Y, Breitling L, Brenner H . DNA methylation changes of whole blood cells in response to active smoking exposure in adults: a systematic review of DNA methylation studies. Clin Epigenetics. 2015; 7:113. PMC: 4609112. DOI: 10.1186/s13148-015-0148-3. View

2.
Reese S, Zhao S, Wu M, Joubert B, Parr C, Haberg S . DNA Methylation Score as a Biomarker in Newborns for Sustained Maternal Smoking during Pregnancy. Environ Health Perspect. 2016; 125(4):760-766. PMC: 5381987. DOI: 10.1289/EHP333. View

3.
Holle R, Happich M, Lowel H, Wichmann H . KORA--a research platform for population based health research. Gesundheitswesen. 2005; 67 Suppl 1:S19-25. DOI: 10.1055/s-2005-858235. View

4.
Bojesen S, Timpson N, Relton C, Davey Smith G, Nordestgaard B . (cg05575921) hypomethylation marks smoking behaviour, morbidity and mortality. Thorax. 2017; 72(7):646-653. PMC: 5520281. DOI: 10.1136/thoraxjnl-2016-208789. View

5.
Zeilinger S, Kuhnel B, Klopp N, Baurecht H, Kleinschmidt A, Gieger C . Tobacco smoking leads to extensive genome-wide changes in DNA methylation. PLoS One. 2013; 8(5):e63812. PMC: 3656907. DOI: 10.1371/journal.pone.0063812. View