» Articles » PMID: 27127764

Model Building Strategy for Logistic Regression: Purposeful Selection

Overview
Journal Ann Transl Med
Date 2016 Apr 30
PMID 27127764
Citations 205
Authors
Affiliations
Soon will be listed here.
Abstract

Logistic regression is one of the most commonly used models to account for confounders in medical literature. The article introduces how to perform purposeful selection model building strategy with R. I stress on the use of likelihood ratio test to see whether deleting a variable will have significant impact on model fit. A deleted variable should also be checked for whether it is an important adjustment of remaining covariates. Interaction should be checked to disentangle complex relationship between covariates and their synergistic effect on response variable. Model should be checked for the goodness-of-fit (GOF). In other words, how the fitted model reflects the real data. Hosmer-Lemeshow GOF test is the most widely used for logistic regression model.

Citing Articles

Development of Chronic Kidney Disease Screening Integrative Care Model Led by Community Pharmacists.

Srimongkhol P, Anutrakulchai S, Theeranut A, Methakanjanasak N, Lertsinudom S Pharmacy (Basel). 2025; 13(1).

PMID: 39998025 PMC: 11858870. DOI: 10.3390/pharmacy13010027.


Association Between Biomass Fuel Use and Depression Symptoms in the Adult Population of Oaxaca, Mexico.

Abeldano Zuniga R, Coca S, Folayan M, Fanta Garrido J, de Lima G Diseases. 2025; 13(2).

PMID: 39997054 PMC: 11854031. DOI: 10.3390/diseases13020047.


The Determinants of Men's Health Behaviors: A Cross-Sectional Study Among Public Safety Personnel in Kelantan, Malaysia.

Haji Mukhti M, Ibrahim M Healthcare (Basel). 2025; 13(3).

PMID: 39942480 PMC: 11817108. DOI: 10.3390/healthcare13030291.


The proper application of logistic regression model in complex survey data: a systematic review.

Dey D, Haque M, Islam M, Aishi U, Shammy S, Mayen M BMC Med Res Methodol. 2025; 25(1):15.

PMID: 39844030 PMC: 11752662. DOI: 10.1186/s12874-024-02454-5.


Water, Sanitation and Hygiene in a Conflict Area: A Cross-Sectional Study in South Kordofan, Sudan.

Asmally R, Imam A, Eissa A, Saeed A, Mohamed A, Abdalla E J Epidemiol Glob Health. 2025; 15(1):4.

PMID: 39833455 PMC: 11753443. DOI: 10.1007/s44197-025-00347-4.


References
1.
Bursac Z, Gauss C, Williams D, Hosmer D . Purposeful selection of variables in logistic regression. Source Code Biol Med. 2008; 3:17. PMC: 2633005. DOI: 10.1186/1751-0473-3-17. View

2.
Mickey R, Greenland S . The impact of confounder selection criteria on effect estimation. Am J Epidemiol. 1989; 129(1):125-37. DOI: 10.1093/oxfordjournals.aje.a115101. View

3.
Royston P, Ambler G, Sauerbrei W . The use of fractional polynomials to model continuous risk variables in epidemiology. Int J Epidemiol. 1999; 28(5):964-74. DOI: 10.1093/ije/28.5.964. View

4.
Greenland S . Modeling and variable selection in epidemiologic analysis. Am J Public Health. 1989; 79(3):340-9. PMC: 1349563. DOI: 10.2105/ajph.79.3.340. View

5.
Zhang Z, Chen K, Ni H, Fan H . Predictive value of lactate in unselected critically ill patients: an analysis using fractional polynomials. J Thorac Dis. 2014; 6(7):995-1003. PMC: 4120171. DOI: 10.3978/j.issn.2072-1439.2014.07.01. View