» Articles » PMID: 37468968

PINNED: Identifying Characteristics of Druggable Human Proteins Using an Interpretable Neural Network

Overview
Journal J Cheminform
Publisher Biomed Central
Specialty Chemistry
Date 2023 Jul 19
PMID 37468968
Authors
Affiliations
Soon will be listed here.
Abstract

The identification of human proteins that are amenable to pharmacologic modulation without significant off-target effects remains an important unsolved challenge. Computational methods have been devised to identify features which distinguish between "druggable" and "undruggable" proteins, finding that protein sequence, tissue and cellular localization, biological role, and position in the protein-protein interaction network are all important discriminant factors. However, many prior efforts to automate the assessment of protein druggability suffer from low performance or poor interpretability. We developed a neural network-based machine learning model capable of generating druggability sub-scores based on each of four distinct categories, combining them to form an overall druggability score. The model achieves an excellent performance in separating drugged and undrugged proteins in the human proteome, with an area under the receiver operating characteristic (AUC) of 0.95. Our use of multiple sub-scores allows the assessment of potential protein targets of interest based on distinct contributors to druggability, leading to a more interpretable and holistic model to identify novel targets.

Citing Articles

Research on Bitter Peptides in the Field of Bioinformatics: A Comprehensive Review.

Liu S, Shi T, Yu J, Li R, Lin H, Deng K Int J Mol Sci. 2024; 25(18).

PMID: 39337334 PMC: 11432553. DOI: 10.3390/ijms25189844.


Unraveling druggable cancer-driving proteins and targeted drugs using artificial intelligence and multi-omics analyses.

Lopez-Cortes A, Cabrera-Andrade A, Echeverria-Garces G, Echeverria-Espinoza P, Pineda-Alban M, Elsitdie N Sci Rep. 2024; 14(1):19359.

PMID: 39169044 PMC: 11339426. DOI: 10.1038/s41598-024-68565-7.


Comprehensive Research on Druggable Proteins: From PSSM to Pre-Trained Language Models.

Chu H, Liu T Int J Mol Sci. 2024; 25(8).

PMID: 38674091 PMC: 11049818. DOI: 10.3390/ijms25084507.


BATMAN-TCM 2.0: an enhanced integrative database for known and predicted interactions between traditional Chinese medicine ingredients and target proteins.

Kong X, Liu C, Zhang Z, Cheng M, Mei Z, Li X Nucleic Acids Res. 2023; 52(D1):D1110-D1120.

PMID: 37904598 PMC: 10767940. DOI: 10.1093/nar/gkad926.

References
1.
Raies A, Tulodziecka E, Stainer J, Middleton L, Dhindsa R, Hill P . DrugnomeAI is an ensemble machine-learning framework for predicting druggability of candidate drug targets. Commun Biol. 2022; 5(1):1291. PMC: 9700683. DOI: 10.1038/s42003-022-04245-4. View

2.
Uhlen M, Fagerberg L, Hallstrom B, Lindskog C, Oksvold P, Mardinoglu A . Proteomics. Tissue-based map of the human proteome. Science. 2015; 347(6220):1260419. DOI: 10.1126/science.1260419. View

3.
Wouters O, Mckee M, Luyten J . Estimated Research and Development Investment Needed to Bring a New Medicine to Market, 2009-2018. JAMA. 2020; 323(9):844-853. PMC: 7054832. DOI: 10.1001/jama.2020.1166. View

4.
Yu C, Lin C, Hwang J . Predicting subcellular localization of proteins for Gram-negative bacteria by support vector machines based on n-peptide compositions. Protein Sci. 2004; 13(5):1402-6. PMC: 2286765. DOI: 10.1110/ps.03479604. View

5.
Ashburner M, Ball C, Blake J, Botstein D, Butler H, Cherry J . Gene ontology: tool for the unification of biology. The Gene Ontology Consortium. Nat Genet. 2000; 25(1):25-9. PMC: 3037419. DOI: 10.1038/75556. View