Need for Cross-Validation of Single Particle Cryo-EM
Overview
Medical Informatics
Authors
Affiliations
Cross-validation is used to determine the validity of a model on unseen data by assessing if the model is overfitted to noise. It is widely used in many fields, from artificial intelligence to structural biology in X-ray crystallography and nuclear magnetic resonance. Although there are concerns of map overfitting in cryo-electron microscopy (cryo-EM), cross-validation is rarely used. The problem is that establishing a performance metric of the maps over unseen data (given by 2D-projection images) is difficult due to the low signal-to-noise ratios in the individual particles. Here, I present recent advances for cryo-EM map reconstruction. I highlight that the gold-standard procedure can fail to detect map overfitting in certain cases, showing the necessity of assessing the map quality on unbiased data. Finally, I describe the challenges and advantages of developing a robust cross-validation methodology for cryo-EM.
Overview and applications of map and model validation tools in the CCP-EM software suite.
Joseph A, Malhotra S, Burnley T, Winn M Faraday Discuss. 2022; 240(0):196-209.
PMID: 35916020 PMC: 9642004. DOI: 10.1039/d2fd00103a.
Emerging Themes in CryoEM─Single Particle Analysis Image Processing.
Vilas J, Carazo J, Sorzano C Chem Rev. 2022; 122(17):13915-13951.
PMID: 35785962 PMC: 9479088. DOI: 10.1021/acs.chemrev.1c00850.
Atomic model validation using the CCP-EM software suite.
Joseph A, Olek M, Malhotra S, Zhang P, Cowtan K, Burnley T Acta Crystallogr D Struct Biol. 2022; 78(Pt 2):152-161.
PMID: 35102881 PMC: 8805302. DOI: 10.1107/S205979832101278X.
Faces of Contemporary CryoEM Information and Modeling.
Palermo G, Sugita Y, Wriggers W, Amaro R J Chem Inf Model. 2020; 60(5):2407-2409.
PMID: 32452204 PMC: 7838532. DOI: 10.1021/acs.jcim.0c00481.