Inference and Uncertainty Quantification for Noisy Matrix Completion
Overview
Authors
Affiliations
Noisy matrix completion aims at estimating a low-rank matrix given only partial and corrupted entries. Despite remarkable progress in designing efficient estimation algorithms, it remains largely unclear how to assess the uncertainty of the obtained estimates and how to perform efficient statistical inference on the unknown matrix (e.g., constructing a valid and short confidence interval for an unseen entry). This paper takes a substantial step toward addressing such tasks. We develop a simple procedure to compensate for the bias of the widely used convex and nonconvex estimators. The resulting debiased estimators admit nearly precise nonasymptotic distributional characterizations, which in turn enable optimal construction of confidence intervals/regions for, say, the missing entries and the low-rank factors. Our inferential procedures do not require sample splitting, thus avoiding unnecessary loss of data efficiency. As a byproduct, we obtain a sharp characterization of the estimation accuracy of our debiased estimators in both rate and constant. Our debiased estimators are tractable algorithms that provably achieve full statistical efficiency.
AUGMENTED DOUBLY ROBUST POST-IMPUTATION INFERENCE FOR PROTEOMIC DATA.
Moon H, Du J, Lei J, Roeder K bioRxiv. 2025; .
PMID: 39868108 PMC: 11761724. DOI: 10.1101/2024.03.23.586387.
Using interpretable machine learning to extend heterogeneous antibody-virus datasets.
Einav T, Ma R Cell Rep Methods. 2023; 3(8):100540.
PMID: 37671020 PMC: 10475791. DOI: 10.1016/j.crmeth.2023.100540.
Matrix completion under complex survey sampling.
Mao X, Wang Z, Yang S Ann Inst Stat Math. 2023; 75(3):463-492.
PMID: 37645434 PMC: 10465119. DOI: 10.1007/s10463-022-00851-5.
Chen Y, Fan J, Wang B, Yan Y J Am Stat Assoc. 2023; 118(542):858-868.
PMID: 37313368 PMC: 10259835. DOI: 10.1080/01621459.2021.1956501.
BRIDGING CONVEX AND NONCONVEX OPTIMIZATION IN ROBUST PCA: NOISE, OUTLIERS, AND MISSING DATA.
Chen Y, Fan J, Ma C, Yan Y Ann Stat. 2022; 49(5):2948-2971.
PMID: 36148268 PMC: 9491514. DOI: 10.1214/21-aos2066.