» Articles » PMID: 36143479

Machine Learning Models for Identification and Prediction of Toxic Organic Compounds Using Transcriptomic Profiles

Overview
Journal Life (Basel)
Specialty Biology
Date 2022 Sep 23
PMID 36143479
Authors
Affiliations
Soon will be listed here.
Abstract

A wide range of environmental factors heavily impact aquatic ecosystems, in turn, affecting human health. Toxic organic compounds resulting from anthropogenic activity are a source of pollution in aquatic ecosystems. To evaluate these contaminants, current approaches mainly rely on acute and chronic toxicity tests, but cannot provide explicit insights into the causes of toxicity. As an alternative, genome-wide gene expression systems allow the identification of contaminants causing toxicity by monitoring the organisms' response to toxic substances. In this study, we selected 22 toxic organic compounds, classified as pesticides, herbicides, or industrial chemicals, that induce environmental problems in aquatic ecosystems and affect human-health. To identify toxic organic compounds using gene expression data from , we evaluated the performance of three machine learning based feature-ranking algorithms (Learning Vector Quantization, Random Forest, and Support Vector Machines with a Linear kernel), and nine classifiers (Linear Discriminant Analysis, Classification And Regression Trees, K-nearest neighbors, Support Vector Machines with a Linear kernel, Random Forest, Boosted C5.0, Gradient Boosting Machine, eXtreme Gradient Boosting with tree, and eXtreme Gradient Boosting with DART booster). Our analysis revealed that a combination of feature selection based on feature-ranking and a random forest classification algorithm had the best model performance, with an accuracy of 95.7%. This is a preliminary study to establish a model for the monitoring of aquatic toxic substances by machine learning. This model could be an effective tool to manage contaminants and toxic organic compounds in aquatic systems.

Citing Articles

Characterization and Expression of the Cytochrome P450 Genes in Exposed to Cerium Oxide Nanoparticles.

Kang X, Zhou Y, Liu Q, Liu M, Chen J, Zhang Y Int J Mol Sci. 2024; 25(19).

PMID: 39409143 PMC: 11476439. DOI: 10.3390/ijms251910812.


Systematic approaches to machine learning models for predicting pesticide toxicity.

Anandhi G, Iyapparaja M Heliyon. 2024; 10(7):e28752.

PMID: 38576573 PMC: 10990867. DOI: 10.1016/j.heliyon.2024.e28752.

References
1.
Zhou J, Du N, Li D, Qin J, Li H, Chen G . Combined effects of perchlorate and hexavalent chromium on the survival, growth and reproduction of Daphnia carinata. Sci Total Environ. 2021; 769:144676. DOI: 10.1016/j.scitotenv.2020.144676. View

2.
Poynton H, Varshavsky J, Chang B, Cavigiolio G, Chan S, Holman P . Daphnia magna ecotoxicogenomics provides mechanistic insights into metal toxicity. Environ Sci Technol. 2007; 41(3):1044-50. DOI: 10.1021/es0615573. View

3.
Poynton H, Loguinov A, Varshavsky J, Chan S, Perkins E, Vulpe C . Gene expression profiling in Daphnia magna part I: concentration-dependent profiles provide support for the No Observed Transcriptional Effect Level. Environ Sci Technol. 2008; 42(16):6250-6. DOI: 10.1021/es8010783. View

4.
Jeremias G, Jesus F, Ventura S, Goncalves F, Asselman J, Pereira J . New insights on the effects of ionic liquid structural changes at the gene expression level: Molecular mechanisms of toxicity in Daphnia magna. J Hazard Mater. 2020; 409:124517. DOI: 10.1016/j.jhazmat.2020.124517. View

5.
Barrett T, Wilhite S, Ledoux P, Evangelista C, Kim I, Tomashevsky M . NCBI GEO: archive for functional genomics data sets--update. Nucleic Acids Res. 2012; 41(Database issue):D991-5. PMC: 3531084. DOI: 10.1093/nar/gks1193. View