Biological Representation of Chemicals Using Latent Target Interaction Profile
Overview
Authors
Affiliations
Background: Computational prediction of a phenotypic response upon the chemical perturbation on a biological system plays an important role in drug discovery, and many other applications. Chemical fingerprints are a widely used feature to build machine learning models. However, the fingerprints that are derived from chemical structures ignore the biological context, thus, they suffer from several problems such as the activity cliff and curse of dimensionality. Fundamentally, the chemical modulation of biological activities is a multi-scale process. It is the genome-wide chemical-target interactions that modulate chemical phenotypic responses. Thus, the genome-scale chemical-target interaction profile will more directly correlate with in vitro and in vivo activities than the chemical structure. Nevertheless, the scope of direct application of the chemical-target interaction profile is limited due to the severe incompleteness, biasness, and noisiness of bioassay data.
Results: To address the aforementioned problems, we developed a novel chemical representation method: Latent Target Interaction Profile (LTIP). LTIP embeds chemicals into a low dimensional continuous latent space that represents genome-scale chemical-target interactions. Subsequently LTIP can be used as a feature to build machine learning models. Using the drug sensitivity of cancer cell lines as a benchmark, we have shown that the LTIP robustly outperforms chemical fingerprints regardless of machine learning algorithms. Moreover, the LTIP is complementary with the chemical fingerprints. It is possible for us to combine LTIP with other fingerprints to further improve the performance of bioactivity prediction.
Conclusions: Our results demonstrate the potential of LTIP in particular and multi-scale modeling in general in predictive modeling of chemical modulation of biological activities.
COVID-19 Multi-Targeted Drug Repurposing Using Few-Shot Learning.
Liu Y, Wu Y, Shen X, Xie L Front Bioinform. 2022; 1:693177.
PMID: 36303751 PMC: 9581066. DOI: 10.3389/fbinf.2021.693177.
A review on machine learning approaches and trends in drug discovery.
Carracedo-Reboredo P, Linares-Blanco J, Rodriguez-Fernandez N, Cedron F, Novoa F, Carballal A Comput Struct Biotechnol J. 2021; 19:4538-4558.
PMID: 34471498 PMC: 8387781. DOI: 10.1016/j.csbj.2021.08.011.
Pham T, Qiu Y, Zeng J, Xie L, Zhang P Nat Mach Intell. 2021; 3(3):247-257.
PMID: 33796820 PMC: 8009091. DOI: 10.1038/s42256-020-00285-9.
Liu Q, Xie L PLoS Comput Biol. 2021; 17(2):e1008653.
PMID: 33577560 PMC: 7906476. DOI: 10.1371/journal.pcbi.1008653.
A deep learning framework for high-throughput mechanism-driven phenotype compound screening.
Pham T, Qiu Y, Zeng J, Xie L, Zhang P bioRxiv. 2020; .
PMID: 32743586 PMC: 7386506. DOI: 10.1101/2020.07.19.211235.