» Articles » PMID: 39609715

CidalsDB: an AI-empowered Platform for Anti-pathogen Therapeutics Research

Overview
Journal J Cheminform
Publisher Biomed Central
Specialty Chemistry
Date 2024 Nov 28
PMID 39609715
Authors
Affiliations
Soon will be listed here.
Abstract

Computer-aided drug discovery (CADD) is nurtured by late advances in big data analytics and Artificial Intelligence (AI) towards enhanced drug discovery (DD) outcomes. In this context, reliable datasets are of utmost importance. We herein present CidalsDB a novel web server for AI-assisted DD against infectious pathogens, namely Leishmania parasites and Coronaviruses. We performed a literature search on molecules with validated anti-pathogen effects. Then, we consolidated these data with bioassays from PubChem. Finally, we constructed a database to store these datasets and make them accessible and ready-to-use for the scientific community through CidalsDB, a web-based interface. In a second step, we implemented and optimized four machine learning (ML) and three deep learning (DL) algorithms that optimally predicted the biological activity of molecules. Random Forests (RF), Multi-Layer Perceptron (MLP) and ChemBERTa were the best classifiers of anti-Leishmania molecules, while Gradient Boosting (GB), Graph-Convolutional Network (GCN) and ChemBERTa achieved the best performances on the Coronaviruses dataset. All six models were optimized and deployed through CidalsDB as anti-pathogen activity prediction models.Scientific contributionCidalsDB is an open access web-based tool that allows browsing and access to ready-to-use datasets of anti-pathogen molecules, alongside best performing AI models for biological activity prediction. It offers a democratized no-code platform for AI-based CADD, which shall foster innovation and collaboration within the DD community. CidalsDB is accessible through https://cidalsdb.streamlit.app/ .

References
1.
Myung Y, de Sa A, Ascher D . Deep-PK: deep learning for small molecule pharmacokinetic and toxicity prediction. Nucleic Acids Res. 2024; 52(W1):W469-W475. PMC: 11223837. DOI: 10.1093/nar/gkae254. View

2.
Swanson K, Walther P, Leitz J, Mukherjee S, Wu J, Shivnaraine R . ADMET-AI: a machine learning ADMET platform for evaluation of large-scale chemical libraries. Bioinformatics. 2024; 40(7). PMC: 11226862. DOI: 10.1093/bioinformatics/btae416. View

3.
Liu Z, Du J, Fang J, Yin Y, Xu G, Xie L . DeepScreening: a deep learning-based screening web server for accelerating drug discovery. Database (Oxford). 2019; 2019. PMC: 6790966. DOI: 10.1093/database/baz104. View

4.
Lv Q, Chen G, He H, Yang Z, Zhao L, Chen H . TCMBank: bridges between the largest herbal medicines, chemical ingredients, target proteins, and associated diseases with intelligence text mining. Chem Sci. 2023; 14(39):10684-10701. PMC: 10566508. DOI: 10.1039/d3sc02139d. View

5.
Harigua-Souiai E, Oualha R, Souiai O, Abdeljaoued-Tej I, Guizani I . Applied Machine Learning Toward Drug Discovery Enhancement: Leishmaniases as a Case Study. Bioinform Biol Insights. 2022; 16:11779322221090349. PMC: 9036323. DOI: 10.1177/11779322221090349. View