An innovative approach for predicting groundwater TDS using optimized ensemble machine learning algorithms at two levels of modeling strategy

Hussam Eldin Elzain, Osman Abdalla*, Hamdi A. Ahmed, Anvar Kacimov, Ali Al-Maktoumi, Khalifa Al-Higgi, Mohammed Abdallah, Mohamed A. Yassin, Venkatramanan Senapathi

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

4 Citations (Scopus)

Abstract

Groundwater salinization in coastal aquifers is a major socioeconomic challenge in Oman and many other regions worldwide due to several anthropogenic activities and natural drivers. Therefore, assessing the salinization of groundwater resources is crucial to ensure the protection of water resources and sustainable management. The aim of this study is to apply a novel approach using predictive optimized ensemble trees-based (ETB) machine learning models, namely Catboost regression (CBR), Extra trees regression (ETR), and Bagging regression (BA), at two levels of modeling strategy for predicting groundwater TDS as an indicator for seawater intrusion in a coastal aquifer, Oman. At level 1, ETR and CBR models were used as base models or inputs for BA in level 2. The results show that the models at level 1 (i.e., ETR and CBR) yielded satisfactory results using a limited number of inputs (Cl, K, and Sr) from a few sets of 40 groundwater wells. The BA model at level 2 improved the overall performance of the modeling by extracting more information from ETR and CBR models at level 1 models. At level 2, the BA model achieved a significant improvement in accuracy (MSE = 0.0002, RSR = 0.062, R2 = 0.995 and NSE = 0.996) compared to each individual model of ETR (MSE = 0.0007, RSR = 0.245, R2 = 0.98 and NSE = 0.94), and CBR (MSE = 0.0035, RSR = 0.258, R2 = 0.933 and NSE = 0.934) at level 1 models in the testing dataset. BA model at level 2 outperformed all models regarding predictive accuracy, best generalization of new data, and matching the locations of the polluted and unpolluted wells. Our approach predicts groundwater TDS with high accuracy and thus provides early warnings of water quality deterioration along coastal aquifers which will improve water resources sustainability.

Original languageEnglish
Article number119896
JournalJournal of Environmental Management
Volume351
DOIs
Publication statusPublished - Feb 1 2024

Keywords

  • Coastal aquifer
  • Ensemble trees models
  • Groundwater
  • Machine learning
  • Modeling at two levels
  • TDS
  • Environmental Monitoring/methods
  • Water Pollutants, Chemical/analysis
  • Water Resources
  • Salinity
  • Seawater

ASJC Scopus subject areas

  • Environmental Engineering
  • Waste Management and Disposal
  • Management, Monitoring, Policy and Law

Cite this