» Articles » PMID: 37967567

A Prediction Model for Classifying Maternal Pregnancy Smoking Using California State Birth Certificate Information

Overview
Authors
Affiliations
Soon will be listed here.
Abstract

Background: Systematically recorded smoking data are not always available in vital statistics records, and even when available it can underestimate true smoking rates.

Objective: To develop a prediction model for maternal tobacco smoking in late pregnancy based on birth certificate information using a combination of self- or provider-reported smoking and biomarkers (smoking metabolites) in neonatal blood spots as the alloyed gold standard.

Methods: We designed a case-control study where childhood cancer cases were identified from the California Cancer Registry and controls were from the California birth rolls between 1983 and 2011 who were cancer-free by the age of six. In this analysis, we included 894 control participants and performed high-resolution metabolomics analyses in their neonatal dried blood spots, where we extracted cotinine [mass-to-charge ratio (m/z) = 177.1023] and hydroxycotinine (m/z = 193.0973). Potential predictors of smoking were selected from California birth certificates. Logistic regression with stepwise backward selection was used to build a prediction model. Model performance was evaluated in a training sample, a bootstrapped sample, and an external validation sample.

Results: Out of seven predictor variables entered into the logistic model, five were selected by the stepwise procedure: maternal race/ethnicity, maternal education, child's birth year, parity, and child's birth weight. We calculated an overall discrimination accuracy of 0.72 and an area under the receiver operating characteristic curve (AUC) of 0.81 (95% confidence interval [CI] 0.77, 0.84) in the training set. Similar accuracies were achieved in the internal (AUC 0.81, 95% CI 0.77, 0.84) and external (AUC 0.69, 95% CI 0.64, 0.74) validation sets.

Conclusions: This easy-to-apply model may benefit future birth registry-based studies when there is missing maternal smoking information; however, some smoking status misclassification remains a concern when only variables from the birth certificate are used to predict maternal smoking.

Citing Articles

From multi-omics to predictive biomarker: AI in tumor microenvironment.

Hai L, Jiang Z, Zhang H, Sun Y Front Immunol. 2025; 15:1514977.

PMID: 39763649 PMC: 11701166. DOI: 10.3389/fimmu.2024.1514977.

References
1.
Liu K, Nellis M, Uppal K, Ma C, Tran V, Liang Y . Reference Standardization for Quantification and Harmonization of Large-Scale Metabolomics. Anal Chem. 2020; 92(13):8836-8844. PMC: 7887762. DOI: 10.1021/acs.analchem.0c00338. View

2.
Murphy S, Wickham K, Lindgren B, Spector L, Joseph A . Cotinine and trans 3'-hydroxycotinine in dried blood spots as biomarkers of tobacco exposure and nicotine metabolism. J Expo Sci Environ Epidemiol. 2013; 23(5):513-8. PMC: 4048618. DOI: 10.1038/jes.2013.7. View

3.
Shipe M, Deppen S, Farjah F, Grogan E . Developing prediction models for clinical use using logistic regression: an overview. J Thorac Dis. 2019; 11(Suppl 4):S574-S584. PMC: 6465431. DOI: 10.21037/jtd.2019.01.25. View

4.
He D, Yan Q, Uppal K, Walker D, Jones D, Ritz B . Metabolite Stability in Archived Neonatal Dried Blood Spots Used for Epidemiologic Research. Am J Epidemiol. 2023; 192(10):1720-1730. PMC: 11004922. DOI: 10.1093/aje/kwad122. View

5.
Sasaki S, Braimoh T, Yila T, Yoshioka E, Kishi R . Self-reported tobacco smoke exposure and plasma cotinine levels during pregnancy--a validation study in Northern Japan. Sci Total Environ. 2011; 412-413:114-8. DOI: 10.1016/j.scitotenv.2011.10.019. View