Deep Learning Empowers the Discovery of Self-Assembling Peptides with Over 10 Trillion Sequences
Overview
Affiliations
Self-assembling of peptides is essential for a variety of biological and medical applications. However, it is challenging to investigate the self-assembling properties of peptides within the complete sequence space due to the enormous sequence quantities. Here, it is demonstrated that a transformer-based deep learning model is effective in predicting the aggregation propensity (AP) of peptide systems, even for decapeptide and mixed-pentapeptide systems with over 10 trillion sequence quantities. Based on the predicted AP values, not only the aggregation laws for designing self-assembling peptides are derived, but the transferability relation among the APs of pentapeptides, decapeptides, and mixed pentapeptides is also revealed, leading to discoveries of self-assembling peptides by concatenating or mixing, as consolidated by experiments. This deep learning approach enables speedy, accurate, and thorough search and design of self-assembling peptides within the complete sequence space of oligopeptides, advancing peptide science by inspiring new biological and medical applications.
Dhoriyani J, Bergman M, Hall C, You F PNAS Nexus. 2025; 4(1):pgae572.
PMID: 39871828 PMC: 11770337. DOI: 10.1093/pnasnexus/pgae572.
Challenges and opportunities of poly(amino acid) nanomedicines in cancer therapy.
Li Y, He J, Liu J, Um W, Ding J Nanomedicine (Lond). 2024; 19(29):2495-2504.
PMID: 39381990 PMC: 11520535. DOI: 10.1080/17435889.2024.2402677.
Aggregation Rules of Short Peptides.
Wang J, Liu Z, Zhao S, Zhang Y, Xu T, Li S JACS Au. 2024; 4(9):3567-3580.
PMID: 39328768 PMC: 11423302. DOI: 10.1021/jacsau.4c00501.
Ren X, Wei J, Luo X, Liu Y, Li K, Zhang Q Adv Sci (Weinh). 2024; 11(26):e2400829.
PMID: 38704695 PMC: 11234452. DOI: 10.1002/advs.202400829.
Deep Learning Empowers the Discovery of Self-Assembling Peptides with Over 10 Trillion Sequences.
Wang J, Liu Z, Zhao S, Xu T, Wang H, Li S Adv Sci (Weinh). 2023; 10(31):e2301544.
PMID: 37749875 PMC: 10625107. DOI: 10.1002/advs.202301544.