» Articles » PMID: 37645963

Multi-omic Stratification of the Missense Variant Cysteinome

Overview
Journal bioRxiv
Date 2023 Aug 30
PMID 37645963
Authors
Affiliations
Soon will be listed here.
Abstract

Cancer genomes are rife with genetic variants; one key outcome of this variation is gain-ofcysteine, which is the most frequently acquired amino acid due to missense variants in COSMIC. Acquired cysteines are both driver mutations and sites targeted by precision therapies. However, despite their ubiquity, nearly all acquired cysteines remain uncharacterized. Here, we pair cysteine chemoproteomics-a technique that enables proteome-wide pinpointing of functional, redox sensitive, and potentially druggable residues-with genomics to reveal the hidden landscape of cysteine acquisition. For both cancer and healthy genomes, we find that cysteine acquisition is a ubiquitous consequence of genetic variation that is further elevated in the context of decreased DNA repair. Our chemoproteogenomics platform integrates chemoproteomic, whole exome, and RNA-seq data, with a customized 2-stage false discovery rate (FDR) error controlled proteomic search, further enhanced with a user-friendly FragPipe interface. Integration of CADD predictions of deleteriousness revealed marked enrichment for likely damaging variants that result in acquisition of cysteine. By deploying chemoproteogenomics across eleven cell lines, we identify 116 gain-of-cysteines, of which 10 were liganded by electrophilic druglike molecules. Reference cysteines proximal to missense variants were also found to be pervasive, 791 in total, supporting heretofore untapped opportunities for proteoform-specific chemical probe development campaigns. As chemoproteogenomics is further distinguished by sample-matched combinatorial variant databases and compatible with redox proteomics and small molecule screening, we expect widespread utility in guiding proteoform-specific biology and therapeutic discovery.

References
1.
Liu Y, Beyer A, Aebersold R . On the Dependency of Cellular Protein Levels on mRNA Abundance. Cell. 2016; 165(3):535-50. DOI: 10.1016/j.cell.2016.03.014. View

2.
Veiga Leprevost F, Haynes S, Avtonomov D, Chang H, Shanmugam A, Mellacheruvu D . Philosopher: a versatile toolkit for shotgun proteomics data analysis. Nat Methods. 2020; 17(9):869-870. PMC: 7509848. DOI: 10.1038/s41592-020-0912-y. View

3.
Ghandi M, Huang F, Jane-Valbuena J, Kryukov G, Lo C, McDonald 3rd E . Next-generation characterization of the Cancer Cell Line Encyclopedia. Nature. 2019; 569(7757):503-508. PMC: 6697103. DOI: 10.1038/s41586-019-1186-3. View

4.
Kong A, Leprevost F, Avtonomov D, Mellacheruvu D, Nesvizhskii A . MSFragger: ultrafast and comprehensive peptide identification in mass spectrometry-based proteomics. Nat Methods. 2017; 14(5):513-520. PMC: 5409104. DOI: 10.1038/nmeth.4256. View

5.
Shi Y, Hata A, Lo R, Massague J, Pavletich N . A structural basis for mutational inactivation of the tumour suppressor Smad4. Nature. 1997; 388(6637):87-93. DOI: 10.1038/40431. View