» Articles » PMID: 37131872

CancerGPT: Few-shot Drug Pair Synergy Prediction Using Large Pre-trained Language Models

Overview
Journal ArXiv
Date 2023 May 3
PMID 37131872
Authors
Affiliations
Soon will be listed here.
Abstract

Large pre-trained language models (LLMs) have been shown to have significant potential in few-shot learning across various fields, even with minimal training data. However, their ability to generalize to unseen tasks in more complex fields, such as biology, has yet to be fully evaluated. LLMs can offer a promising alternative approach for biological inference, particularly in cases where structured data and sample size are limited, by extracting prior knowledge from text corpora. Our proposed few-shot learning approach uses LLMs to predict the synergy of drug pairs in rare tissues that lack structured data and features. Our experiments, which involved seven rare tissues from different cancer types, demonstrated that the LLM-based prediction model achieved significant accuracy with very few or zero samples. Our proposed model, the CancerGPT (with ~ 124M parameters), was even comparable to the larger fine-tuned GPT-3 model (with ~ 175B parameters). Our research is the first to tackle drug pair synergy prediction in rare tissues with limited data. We are also the first to utilize an LLM-based prediction model for biological reaction prediction tasks.

Citing Articles

Computational Approaches: A New Frontier in Cancer Research.

Srivastava S, Jain P Comb Chem High Throughput Screen. 2023; 27(13):1861-1876.

PMID: 38031782 DOI: 10.2174/0113862073265604231106112203.

References
1.
Jones R, Vuky J, Elliott T, Mead G, Arranz J, Chester J . Phase II study to assess the efficacy, safety and tolerability of the mitotic spindle kinesin inhibitor AZD4877 in patients with recurrent advanced urothelial cancer. Invest New Drugs. 2013; 31(4):1001-7. DOI: 10.1007/s10637-013-9926-y. View

2.
Madani A, Krause B, Greene E, Subramanian S, Mohr B, Holton J . Large language models generate functional protein sequences across diverse families. Nat Biotechnol. 2023; 41(8):1099-1106. PMC: 10400306. DOI: 10.1038/s41587-022-01618-2. View

3.
Zheng S, Aldahdooh J, Shadbahr T, Wang Y, Aldahdooh D, Bao J . DrugComb update: a more comprehensive drug sensitivity data repository and analysis portal. Nucleic Acids Res. 2021; 49(W1):W174-W184. PMC: 8218202. DOI: 10.1093/nar/gkab438. View

4.
Lin X, Patil S, Gao Y, Qian A . The Bone Extracellular Matrix in Bone Formation and Regeneration. Front Pharmacol. 2020; 11:757. PMC: 7264100. DOI: 10.3389/fphar.2020.00757. View

5.
Yadav B, Wennerberg K, Aittokallio T, Tang J . Searching for Drug Synergy in Complex Dose-Response Landscapes Using an Interaction Potency Model. Comput Struct Biotechnol J. 2016; 13:504-13. PMC: 4759128. DOI: 10.1016/j.csbj.2015.09.001. View