» Articles » PMID: 33301333

Scaffold-Constrained Molecular Generation

Overview
Date 2020 Dec 10
PMID 33301333
Citations 10
Authors
Affiliations
Soon will be listed here.
Abstract

One of the major applications of generative models for drug discovery targets the lead-optimization phase. During the optimization of a lead series, it is common to have scaffold constraints imposed on the structure of the molecules designed. Without enforcing such constraints, the probability of generating molecules with the required scaffold is extremely low and hinders the practicality of generative models for de novo drug design. To tackle this issue, we introduce a new algorithm, named SAMOA (Scaffold Constrained Molecular Generation), to perform scaffold-constrained in silico molecular design. We build on the well-known SMILES-based Recurrent Neural Network (RNN) generative model, with a modified sampling procedure to achieve scaffold-constrained generation. We directly benefit from the associated reinforcement learning methods, allowing to design molecules optimized for different properties while exploring only the relevant chemical space. We showcase the method's ability to perform scaffold-constrained generation on various tasks: designing novel molecules around scaffolds extracted from SureChEMBL chemical series, generating novel active molecules on the Dopamine Receptor D2 (DRD2) target, and finally, designing predicted actives on the MMP-12 series, an industrial lead-optimization project.

Citing Articles

fragSMILES as a chemical string notation for advanced fragment and chirality representation.

Mastrolorito F, Ciriaco F, Togo M, Gambacorta N, Trisciuzzi D, Altomare C Commun Chem. 2025; 8(1):26.

PMID: 39880917 PMC: 11779804. DOI: 10.1038/s42004-025-01423-3.


A systematic review of deep learning chemical language models in recent era.

Flores-Hernandez H, Martinez-Ledesma E J Cheminform. 2024; 16(1):129.

PMID: 39558376 PMC: 11571686. DOI: 10.1186/s13321-024-00916-y.


PromptSMILES: prompting for scaffold decoration and fragment linking in chemical language models.

Thomas M, Ahmad M, Tresadern G, De Fabritiis G J Cheminform. 2024; 16(1):77.

PMID: 38965600 PMC: 11225391. DOI: 10.1186/s13321-024-00866-5.


Cheminformatics and artificial intelligence for accelerating agrochemical discovery.

Djoumbou-Feunang Y, Wilmot J, Kinney J, Chanda P, Yu P, Sader A Front Chem. 2023; 11:1292027.

PMID: 38093816 PMC: 10716421. DOI: 10.3389/fchem.2023.1292027.


UnCorrupt SMILES: a novel approach to de novo design.

Schoenmaker L, Bequignon O, Jespers W, van Westen G J Cheminform. 2023; 15(1):22.

PMID: 36788579 PMC: 9926805. DOI: 10.1186/s13321-023-00696-x.