» Articles » PMID: 34812602

UV-adVISor: Attention-Based Recurrent Neural Networks to Predict UV-Vis Spectra

Overview
Journal Anal Chem
Specialty Chemistry
Date 2021 Nov 23
PMID 34812602
Citations 8
Authors
Affiliations
Soon will be listed here.
Abstract

Ultraviolet-visible (UV-Vis) absorption spectra are routinely collected as part of high-performance liquid chromatography (HPLC) analysis systems and can be used to identify chemical reaction products by comparison to the reference spectra. Here, we present UV-adVISor as a new computational tool for predicting the UV-Vis spectra from a molecule's structure alone. UV-Vis prediction was approached as a sequence-to-sequence problem. We utilized Long-Short Term Memory and attention-based neural networks with Extended Connectivity Fingerprint Diameter 6 or molecule SMILES to generate predictive models for the UV spectra. We have produced two spectrum datasets (dataset I, = 949, and dataset II, = 2222) using different compound collections and spectrum acquisition methods to train, validate, and test our models. We evaluated the prediction accuracy of the complete spectra by the correspondence of wavelengths of absorbance maxima and with a series of statistical measures (the best test set median model parameters are in parentheses for model II), including RMSE (0.064), (0.71), and dynamic time warping (DTW, 0.194) of the entire spectrum curve. Scrambling molecule structures with the experimental spectra during training resulted in a degraded , confirming the utility of the approaches for prediction. UV-adVISor is able to provide fast and accurate predictions for libraries of compounds.

Citing Articles

Natural Products Dereplication: Databases and Analytical Methods.

Perez-Victoria I Prog Chem Org Nat Prod. 2024; 124:1-56.

PMID: 39101983 DOI: 10.1007/978-3-031-59567-7_1.


Predicting the Hallucinogenic Potential of Molecules Using Artificial Intelligence.

Urbina F, Jones T, Harris J, Snyder S, Lane T, Ekins S ACS Chem Neurosci. 2024; 15(16):3078-3089.

PMID: 39092989 PMC: 11338697. DOI: 10.1021/acschemneuro.4c00405.


The Goldilocks paradigm: comparing classical machine learning, large language models, and few-shot learning for drug discovery applications.

Snyder S, Vignaux P, Ozalp M, Gerlach J, Puhl A, Lane T Commun Chem. 2024; 7(1):134.

PMID: 38866916 PMC: 11169557. DOI: 10.1038/s42004-024-01220-4.


Interactions of ferulic acid and ferulic acid methyl ester with endogenous proteins: Determination using the multi-methods.

Yang Y, Wang S, Liu X, Zhang W, Tong W, Luo H Heliyon. 2024; 10(2):e24605.

PMID: 38312678 PMC: 10835327. DOI: 10.1016/j.heliyon.2024.e24605.


Machine Learning Spectroscopy Using a 2-Stage, Generalized Constituent Contribution Protocol.

Fan J, Qian C, Zhou S Research (Wash D C). 2023; 6:0115.

PMID: 37287889 PMC: 10243197. DOI: 10.34133/research.0115.


References
1.
Bagnall A, Lines J, Bostrom A, Large J, Keogh E . The great time series classification bake off: a review and experimental evaluation of recent algorithmic advances. Data Min Knowl Discov. 2019; 31(3):606-660. PMC: 6404674. DOI: 10.1007/s10618-016-0483-9. View

2.
Dobson C . Chemical space and biology. Nature. 2004; 432(7019):824-8. DOI: 10.1038/nature03192. View

3.
Beard E, Sivaraman G, Vazquez-Mayagoitia A, Vishwanath V, Cole J . Comparative dataset of experimental and computational attributes of UV/vis absorption spectra. Sci Data. 2019; 6(1):307. PMC: 6895184. DOI: 10.1038/s41597-019-0306-0. View

4.
Garcia R, Maltarollo V, Honorio K, Trossini G . Benchmark studies of UV-vis spectra simulation for cinnamates with UV filter profile. J Mol Model. 2015; 21(6):150. DOI: 10.1007/s00894-015-2689-y. View

5.
Russo D, Zorn K, Clark A, Zhu H, Ekins S . Comparing Multiple Machine Learning Algorithms and Metrics for Estrogen Receptor Binding Prediction. Mol Pharm. 2018; 15(10):4361-4370. PMC: 6181119. DOI: 10.1021/acs.molpharmaceut.8b00546. View