» Articles » PMID: 35237377

ASSESSING SELECTION BIAS IN REGRESSION COEFFICIENTS ESTIMATED FROM NONPROBABILITY SAMPLES WITH APPLICATIONS TO GENETICS AND DEMOGRAPHIC SURVEYS

Overview
Journal Ann Appl Stat
Date 2022 Mar 3
PMID 35237377
Authors
Affiliations
Soon will be listed here.
Abstract

Selection bias is a serious potential problem for inference about relationships of scientific interest based on samples without well-defined probability sampling mechanisms. Motivated by the potential for selection bias in: (a) estimated relationships of polygenic scores (PGSs) with phenotypes in genetic studies of volunteers and (b) estimated differences in subgroup means in surveys of smartphone users, we derive novel measures of selection bias for estimates of the coefficients in linear and probit regression models fitted to nonprobability samples, when aggregate-level auxiliary data are available for the selected sample and the target population. The measures arise from normal pattern-mixture models that allow analysts to examine the sensitivity of their inferences to assumptions about nonignorable selection in these samples. We examine the effectiveness of the proposed measures in a simulation study and then use them to quantify the selection bias in: (a) estimated PGS-phenotype relationships in a large study of volunteers recruited via Facebook and (b) estimated subgroup differences in mean past-year employment duration in a nonprobability sample of low-educated smartphone users. We evaluate the performance of the measures in these applications using benchmark estimates from large probability samples.

Citing Articles

Analyzing Potential Non-Ignorable Selection Bias in an Off-Wave Mail Survey Implemented in a Long-Standing Panel Study.

Schroeder H, West B J Surv Stat Methodol. 2025; 13(1):100-127.

PMID: 39877150 PMC: 11770253. DOI: 10.1093/jssam/smae039.


Risk of Traumatic Intracranial Hemorrhage After Stroke: A Nationwide Population-Based Cohort Study in Taiwan.

Fang Y, Liao S, Chen P, Yeh T, Chen C, Piravej K J Am Heart Assoc. 2024; 13(19):e035725.

PMID: 39291491 PMC: 11681476. DOI: 10.1161/JAHA.124.035725.


Evaluating Pre-election Polling Estimates Using a New Measure of Non-ignorable Selection Bias.

West B, Andridge R Public Opin Q. 2023; 87(Suppl 1):575-601.

PMID: 37705923 PMC: 10496568. DOI: 10.1093/poq/nfad018.


ASSESSING SELECTION BIAS IN REGRESSION COEFFICIENTS ESTIMATED FROM NONPROBABILITY SAMPLES WITH APPLICATIONS TO GENETICS AND DEMOGRAPHIC SURVEYS.

West B, Little R, Andridge R, Boonstra P, Ware E, Pandit A Ann Appl Stat. 2022; 15(3):1556-1581.

PMID: 35237377 PMC: 8887878. DOI: 10.1214/21-aoas1453.

References
1.
West B, Little R, Andridge R, Boonstra P, Ware E, Pandit A . ASSESSING SELECTION BIAS IN REGRESSION COEFFICIENTS ESTIMATED FROM NONPROBABILITY SAMPLES WITH APPLICATIONS TO GENETICS AND DEMOGRAPHIC SURVEYS. Ann Appl Stat. 2022; 15(3):1556-1581. PMC: 8887878. DOI: 10.1214/21-aoas1453. View

2.
. Large-scale genome-wide association analysis of bipolar disorder identifies a new susceptibility locus near ODZ4. Nat Genet. 2011; 43(10):977-83. PMC: 3637176. DOI: 10.1038/ng.943. View

3.
Belsky D, Israel S . Integrating genetics and social science: genetic risk scores. Biodemography Soc Biol. 2014; 60(2):137-55. PMC: 4274737. DOI: 10.1080/19485565.2014.946591. View

4.
Dudbridge F . Polygenic Epidemiology. Genet Epidemiol. 2016; 40(4):268-72. PMC: 4982028. DOI: 10.1002/gepi.21966. View

5.
Stein M, Ware E, Mitchell C, Chen C, Borja S, Cai T . Genomewide association studies of suicide attempts in US soldiers. Am J Med Genet B Neuropsychiatr Genet. 2017; 174(8):786-797. PMC: 5685938. DOI: 10.1002/ajmg.b.32594. View