High-dimensional Propensity Score Adjustment in Studies of Treatment Effects Using Health Care Claims Data
Overview
Authors
Affiliations
Background: Adjusting for large numbers of covariates ascertained from patients' health care claims data may improve control of confounding, as these variables may collectively be proxies for unobserved factors. Here, we develop and test an algorithm that empirically identifies candidate covariates, prioritizes covariates, and integrates them into a propensity-score-based confounder adjustment model.
Methods: We developed a multistep algorithm to implement high-dimensional proxy adjustment in claims data. Steps include (1) identifying data dimensions, eg, diagnoses, procedures, and medications; (2) empirically identifying candidate covariates; (3) assessing recurrence of codes; (4) prioritizing covariates; (5) selecting covariates for adjustment; (6) estimating the exposure propensity score; and (7) estimating an outcome model. This algorithm was tested in Medicare claims data, including a study on the effect of Cox-2 inhibitors on reduced gastric toxicity compared with nonselective nonsteroidal anti-inflammatory drugs (NSAIDs).
Results: In a population of 49,653 new users of Cox-2 inhibitors or nonselective NSAIDs, a crude relative risk (RR) for upper GI toxicity (RR = 1.09 [95% confidence interval = 0.91-1.30]) was initially observed. Adjusting for 15 predefined covariates resulted in a possible gastroprotective effect (0.94 [0.78-1.12]). A gastroprotective effect became stronger when adjusting for an additional 500 algorithm-derived covariates (0.88 [0.73-1.06]). Results of a study on the effect of statin on reduced mortality were similar. Using the algorithm adjustment confirmed a null finding between influenza vaccination and hip fracture (1.02 [0.85-1.21]).
Conclusions: In typical pharmacoepidemiologic studies, the proposed high-dimensional propensity score resulted in improved effect estimates compared with adjustment limited to predefined covariates, when benchmarked against results expected from randomized trials.
Survival outcomes among hospitalized patients with dementia: a propensity score matching analysis.
Rodriguez H, Diaz-Dussan N, Guzman-Sabogal Y, Proanos J, Tuta-Quintero E Acta Neurol Belg. 2025; .
PMID: 40087233 DOI: 10.1007/s13760-025-02746-7.
Wyss R, Yang J, Schneeweiss S, Plasek J, Zhou L, Deramus T medRxiv. 2025; .
PMID: 39974094 PMC: 11838641. DOI: 10.1101/2025.01.30.25321403.
Karim M, Hossain M, Ng H, Zhu F, Frank H, Tremlett H Pharmacoepidemiol Drug Saf. 2025; 34(2):e70112.
PMID: 39901338 PMC: 11791124. DOI: 10.1002/pds.70112.
Mapping the effectiveness and risks of GLP-1 receptor agonists.
Xie Y, Choi T, Al-Aly Z Nat Med. 2025; .
PMID: 39833406 DOI: 10.1038/s41591-024-03412-w.
Hojlund M, Wesselhoeft R, Heinrichsen M, Pagsberg A, Correll C, Steinhausen H World Psychiatry. 2025; 24(1):103-112.
PMID: 39810688 PMC: 11733449. DOI: 10.1002/wps.21279.