» Articles » PMID: 30652356

Using Simulation Studies to Evaluate Statistical Methods

Overview
Journal Stat Med
Publisher Wiley
Specialty Public Health
Date 2019 Jan 18
PMID 30652356
Citations 339
Authors
Affiliations
Soon will be listed here.
Abstract

Simulation studies are computer experiments that involve creating data by pseudo-random sampling. A key strength of simulation studies is the ability to understand the behavior of statistical methods because some "truth" (usually some parameter/s of interest) is known from the process of generating the data. This allows us to consider properties of methods, such as bias. While widely used, simulation studies are often poorly designed, analyzed, and reported. This tutorial outlines the rationale for using simulation studies and offers guidance for design, execution, analysis, reporting, and presentation. In particular, this tutorial provides a structured approach for planning and reporting simulation studies, which involves defining aims, data-generating mechanisms, estimands, methods, and performance measures ("ADEMP"); coherent terminology for simulation studies; guidance on coding simulation studies; a critical discussion of key performance measures and their estimation; guidance on structuring tabular and graphical presentation of results; and new graphical presentations. With a view to describing recent practice, we review 100 articles taken from Volume 34 of Statistics in Medicine, which included at least one simulation study and identify areas for improvement.

Citing Articles

Improving genetic variant identification for quantitative traits using ensemble learning-based approaches.

Sharma J, Jangale V, Shekhawat R, Yadav P BMC Genomics. 2025; 26(1):237.

PMID: 40075256 PMC: 11899862. DOI: 10.1186/s12864-025-11443-x.


A Spline-Based Approach to Smoothly Constrain Hazard Ratios With a View to Apply Treatment Effect Waning.

Jennings A, Rutherford M, Lambert P Stat Med. 2025; 44(6):e70035.

PMID: 40059376 PMC: 11891414. DOI: 10.1002/sim.70035.


Wavelet-Mixed Landmark Survival Models for the Effect of Short-Term Changes of Potassium in Heart Failure Patients.

Gregorio C, Barbati G, Scagnetto A, Lenarda A, Ieva F Biom J. 2025; 67(2):e70043.

PMID: 40047178 PMC: 11883744. DOI: 10.1002/bimj.70043.


A Seamless Design for the Combination of a Case-Control and a Cohort Diagnostic Accuracy Study.

Bibiza-Freiwald E, Vach W, Zapf A Stat Med. 2025; 44(6):e70016.

PMID: 40042437 PMC: 11881794. DOI: 10.1002/sim.70016.


Analyzing Coarsened and Missing Data by Imputation Methods.

van der Burg L, Bohringer S, Bartlett J, Bosse T, Horeweg N, de Wreede L Stat Med. 2025; 44(6):e70032.

PMID: 40042406 PMC: 11881681. DOI: 10.1002/sim.70032.


References
1.
Kimani P, Todd S, Stallard N . Estimation after subpopulation selection in adaptive seamless trials. Stat Med. 2015; 34(18):2581-601. PMC: 4973856. DOI: 10.1002/sim.6506. View

2.
Thompson J, Fielding K, Davey C, Aiken A, Hargreaves J, Hayes R . Bias and inference from misspecified mixed-effect models in stepped wedge trial analysis. Stat Med. 2017; 36(23):3670-3682. PMC: 5600088. DOI: 10.1002/sim.7348. View

3.
van Smeden M, de Groot J, Moons K, Collins G, Altman D, Eijkemans M . No rationale for 1 variable per 10 events criterion for binary logistic regression analysis. BMC Med Res Methodol. 2016; 16(1):163. PMC: 5122171. DOI: 10.1186/s12874-016-0267-3. View

4.
Chaurasia A, Harel O . Partial F-tests with multiply imputed data in the linear regression framework via coefficient of determination. Stat Med. 2014; 34(3):432-43. DOI: 10.1002/sim.6334. View

5.
Lambert P, Dickman P, Rutherford M . Comparison of different approaches to estimating age standardized net survival. BMC Med Res Methodol. 2015; 15:64. PMC: 4537569. DOI: 10.1186/s12874-015-0057-3. View