Criteria for the Validation of Surrogate Endpoints in Randomized Experiments

Overview

Journal Biometrics

Publisher Oxford University Press

Specialty Public Health

Date 1998 Dec 5

PMID 9840970

Citations 96

Authors

M Buyse

G Molenberghs

Affiliations

Soon will be listed here.

Abstract

The validation of surrogate endpoints has been studied by Prentice (1989, Statistics in Medicine 8, 431-440) and Freedman, Graubard, and Schatzkin (1992, Statistics in Medicine 11, 167-178). We extended their proposals in the cases where the surrogate and the final endpoints are both binary or normally distributed. Letting T and S be random variables that denote the true and surrogate endpoint, respectively, and Z be an indicator variable for treatment, Prentice's criteria are fulfilled if Z has a significant effect on T and on S, if S has a significant effect on T, and if Z has no effect on T given S. Freedman relaxed the latter criterion by estimating PE, the proportion of the effect of Z on T that is explained by S, and by requiring that the lower confidence limit of PE be larger than some proportion, say 0.5 or 0.75. This condition can only be verified if the treatment has a massively significant effect on the true endpoint, a rare situation. We argue that two other quantities must be considered in the validation of a surrogate endpoint: RE, the effect of Z on T relative to that of Z on S, and gamma Z, the association between S and T after adjustment for Z. A surrogate is said to be perfect at the individual level when there is a perfect association between the surrogate and the final endpoint after adjustment for treatment. A surrogate is said to be perfect at the population level if RE is 1. A perfect surrogate fulfills both conditions, in which case S and T are identical up to a deterministic transformation. Fieller's theorem is used for the estimation of PE, RE, and their respective confidence intervals. Logistic regression models and the global odds ratio model studied by Dale (1986, Biometrics, 42, 909-917) are used for binary endpoints. Linear models are employed for continuous endpoints. In order to be of practical value, the validation of surrogate endpoints is shown to require large numbers of observations.

Citing Articles

Assessing heterogeneity in surrogacy using censored data.

Parast L, Tian L, Cai T Stat Med. 2024; 43(17):3184-3209.

PMID: 38812276 PMC: 11317910. DOI: 10.1002/sim.10122.

Considerations for using potential surrogate endpoints in cancer screening trials.

Webb A, Berg C, Castle P, Crosby D, Etzioni R, Kessler L Lancet Oncol. 2024; 25(5):e183-e192.

PMID: 38697164 PMC: 7616115. DOI: 10.1016/S1470-2045(24)00015-9.

Pathway for Development and Validation of Multi-domain Endpoints for Amyloid Light Chain (AL) Amyloidosis.

Signorovitch J, Zhang J, Brown D, Dunnmon P, Xiu L, Done N Ther Innov Regul Sci. 2024; 58(4):600-609.

PMID: 38632158 PMC: 11169055. DOI: 10.1007/s43441-024-00641-6.

A rank-based approach to evaluate a surrogate marker in a small sample setting.

Parast L, Cai T, Tian L Biometrics. 2024; 80(1).

PMID: 38386359 PMC: 10883071. DOI: 10.1093/biomtc/ujad035.

Statistical Methods to Evaluate Surrogate Markers.

Parast L, Tian L, Cai T, Palaniappan L Med Care. 2023; 62(2):102-108.

PMID: 38079232 PMC: 10842261. DOI: 10.1097/MLR.0000000000001956.