» Articles » PMID: 39843505

Comparative Analysis of Correlation and Causality Inference in Water Quality Problems with Emphasis on TDS Karkheh River in Iran

Overview
Journal Sci Rep
Date 2025 Jan 22
PMID 39843505
Authors
Affiliations
Soon will be listed here.
Abstract

Water quality management is a critical aspect of environmental sustainability, particularly in arid and semi-arid regions such as Iran where water scarcity is compounded by quality degradation. This study delves into the causal relationships influencing water quality, focusing on Total Dissolved Solids (TDS) as a primary indicator in the Karkheh River, southwest Iran. Utilizing a comprehensive dataset spanning 50 years (1968-2018), this research integrates Machine Learning (ML) techniques to examine correlations and infer causality among multiple parameters, including flow rate (Q), Sodium (Na), Magnesium (Mg), Calcium (Ca), Chloride (Cl), Sulfate (SO), Bicarbonates (HCO), and pH. For modeling the causation, the "Back door linear regression" approach has been considered which establishes a stable and interpretable framework in causal inference by focusing on clear assumptions. Predictive modeling was used to show the difference between correlation and causation along with interpretability modeling to make the predictive model transparent. Predictive modeling does not report the causality among the variables as it showed Mg is not contributing to the target (TDS) while the findings reveal that TDS is predominantly positive influenced by Mg, Na, Cl, Ca and SO, with HCO and pH exerting negative (inverse) effects. Unlike correlations, causal relationships demonstrate directional and often unequal influences, highlighting Mg as a critical driver of TDS levels. This novel application of ML-based causal inference in water quality research provides a cost-effective and time-efficient alternative to traditional experimental methods. The results underscore the potential of ML-driven causal analysis to guide water resource management and policy-making. By identifying the key drivers of TDS, this study proposes targeted interventions to mitigate water quality deterioration. Moreover, the insights gained lay the foundation for developing early warning systems, ensuring proactive and sustainable water quality management in similar hydrological contexts.

References
1.
Shakeri R, Amini H, Fakheri F, Ketabchi H . Assessment of drought conditions and prediction by machine learning algorithms using Standardized Precipitation Index and Standardized Water-Level Index (case study: Yazd province, Iran). Environ Sci Pollut Res Int. 2023; 30(45):101744-101760. DOI: 10.1007/s11356-023-29522-5. View

1.
Zhang R, Pu L, Li J, Zhang J, Xu Y . Landscape ecological security response to land use change in the tidal flat reclamation zone, China. Environ Monit Assess. 2015; 188(1):1. DOI: 10.1007/s10661-015-4999-z. View

2.
Zhu M, Wang J, Yang X, Zhang Y, Zhang L, Ren H . A review of the application of machine learning in water quality evaluation. Eco Environ Health. 2023; 1(2):107-116. PMC: 10702893. DOI: 10.1016/j.eehl.2022.06.001. View

3.
Suter 2nd G, Cormier S . A method for assessing the potential for confounding applied to ionic strength in central Appalachian streams. Environ Toxicol Chem. 2012; 32(2):288-95. DOI: 10.1002/etc.2054. View

4.
Samuel R . Total Dissolved Solids (TDS) Less than 1000 ppm in Drinking Water Did Not Impact Nursery Pig Performance. Vet Sci. 2022; 9(11). PMC: 9695767. DOI: 10.3390/vetsci9110622. View