» Articles » PMID: 29398733

TARGETED SEQUENTIAL DESIGN FOR TARGETED LEARNING INFERENCE OF THE OPTIMAL TREATMENT RULE AND ITS MEAN REWARD

Overview
Journal Ann Stat
Specialty Public Health
Date 2018 Feb 6
PMID 29398733
Citations 6
Authors
Affiliations
Soon will be listed here.
Abstract

This article studies the targeted sequential inference of an optimal treatment rule (TR) and its mean reward in the non-exceptional case, , assuming that there is no stratum of the baseline covariates where treatment is neither beneficial nor harmful, and under a companion margin assumption. Our pivotal estimator, whose definition hinges on the targeted minimum loss estimation (TMLE) principle, actually infers the mean reward under the current estimate of the optimal TR. This data-adaptive statistical parameter is worthy of interest on its own. Our main result is a central limit theorem which enables the construction of confidence intervals on both mean rewards under the current estimate of the optimal TR and under the optimal TR itself. The asymptotic variance of the estimator takes the form of the variance of an efficient influence curve at a limiting distribution, allowing to discuss the efficiency of inference. As a by product, we also derive confidence intervals on two cumulated pseudo-regrets, a key notion in the study of bandits problems. A simulation study illustrates the procedure. One of the corner-stones of the theoretical study is a new maximal inequality for martingales with respect to the uniform entropy integral.

Citing Articles

Adaptive sequential surveillance with network and temporal dependence.

Malenica I, Coyle J, van der Laan M, Petersen M Biometrics. 2024; 80(1).

PMID: 38281772 PMC: 10826884. DOI: 10.1093/biomtc/ujad007.


Non-asymptotic Properties of Individualized Treatment Rules from Sequentially Rule-Adaptive Trials.

Gao D, Liu Y, Zeng D J Mach Learn Res. 2023; 23(250).

PMID: 37576335 PMC: 10419117.


Performance Guarantees for Policy Learning.

Luedtke A, Chambaz A Ann I H P Probab Stat. 2022; 56(3):2162-2188.

PMID: 35321441 PMC: 8939837. DOI: 10.1214/19-aihp1034.


Statistical Inference for Online Decision-Making: In a Contextual Bandit Setting.

Chen H, Lu W, Song R J Am Stat Assoc. 2021; 116(533):240-255.

PMID: 33737759 PMC: 7962379. DOI: 10.1080/01621459.2020.1770098.


Personalized public health: An implementation research agenda for the HIV response and beyond.

Geng E, Holmes C, Moshabela M, Sikazwe I, Petersen M PLoS Med. 2020; 16(12):e1003020.

PMID: 31891581 PMC: 6938296. DOI: 10.1371/journal.pmed.1003020.


References
1.
Friedman J, Hastie T, Tibshirani R . Regularization Paths for Generalized Linear Models via Coordinate Descent. J Stat Softw. 2010; 33(1):1-22. PMC: 2929880. View

2.
Zhao Y, Zeng D, Laber E, Kosorok M . New Statistical Learning Methods for Estimating Optimal Dynamic Treatment Regimes. J Am Stat Assoc. 2015; 110(510):583-598. PMC: 4517946. DOI: 10.1080/01621459.2014.937488. View

3.
Rubin D, van der Laan M . Statistical issues and limitations in personalized medicine research with clinical trials. Int J Biostat. 2012; 8(1):18. PMC: 6038934. DOI: 10.1515/1557-4679.1423. View

4.
Chambaz A, van der Laan M . Estimation and testing in targeted goup sequential covariate-adjusted randomized clinical trials. Scand Stat Theory Appl. 2018; 41(1):104-140. PMC: 6084807. DOI: 10.1111/sjos.12013. View

5.
Chakraborty B, Laber E, Zhao Y . Inference about the expected performance of a data-driven dynamic treatment regime. Clin Trials. 2014; 11(4):408-417. PMC: 4265005. DOI: 10.1177/1740774514537727. View