A multi-model data fusion methodology for reservoir water quality based on machine learning algorithms and bayesian maximum entropy

Mohammad G. Zamani; Mohammad Reza Nikoo; Fereshteh Niknazar; Ghazi Al-Rawas; Malik Al-Wardy; Amir H. Gandomi

doi:10.1016/j.jclepro.2023.137885

A multi-model data fusion methodology for reservoir water quality based on machine learning algorithms and bayesian maximum entropy

Mohammad G. Zamani, Mohammad Reza Nikoo^*, Fereshteh Niknazar, Ghazi Al-Rawas, Malik Al-Wardy, Amir H. Gandomi

^*المؤلف المقابل لهذا العمل

نتاج البحث: المساهمة في مجلة › Article › مراجعة النظراء

11 اقتباسات (Scopus)

ملخص

A major concern in the management of reservoirs is water quality because of the negative consequences it has on both environment and human life. Artificial Intelligence (AI) concept produces a reliable framework to recognize complicated and non-linear correlations between input and output data. Although various machine learning (ML) algorithms in recent studies were employed to predict water quality variables, the existing literature lacks exploring the combination of these algorithms, which has the potential to significantly amplify the outcomes achieved by individual models. Thus, the current study aims to bridge this knowledge gap by evaluating the precision of Random Forest Regression (RFR), Support Vector Regression, Multilayer Perceptron (MLP), and Bayesian Maximum Entropy-based Fusion (BMEF) models to estimate such water quality variables as dissolved oxygen (DO) and chlorophyll-a (Chl-a). The comparisons were conducted in two primary stages: (1) a comparison of the outcomes of different ML algorithms with each other, and (2) comparing the ML algorithms' findings with that of the BMEF model, which considers uncertainty. These comparisons were evaluated using robust statistical measures, and, finally, to indicate the utility and efficacy of the newly introduced framework, it was efficiently utilized in Wadi Dayqah Dam, which is situated in Oman. The findings indicated that, throughout both training and testing phases, the BMEF model outperformed individual machine learning models, namely MLP, RFR, and SVR by 5%, 26%, and 10%, respectively, when R² and Chl-a are considered as evaluation index and water quality variables, respectively. Additionally, as the individual ML models are not capable of predicting electrical conductivity and oxidation-reduction potential efficiently, the BMEF model leads to better results by R²=0.89, which outperforms MLP (R²=0.81), RFR (R²=0.79), and SVR (R²=0.62) for oxidation-reduction potential. Regarding the study limits of the present study, spatio-temporal data should be collected over a long time to increase the data frequency and reduce the uncertainty related to climate variability.

اللغة الأصلية	English
رقم المقال	137885
دورية	Journal of Cleaner Production
مستوى الصوت	416
المعرِّفات الرقمية للأشياء	https://doi.org/10.1016/j.jclepro.2023.137885
حالة النشر	Published - سبتمبر 1 2023

ASJC Scopus subject areas

???subjectarea.asjc.2100.2105???
???subjectarea.asjc.2300.2300???
???subjectarea.asjc.1400.1408???
???subjectarea.asjc.2200.2209???

أهداف الأمم المتحدة للتنمية المستدامة

يساهم هذا المخرج في تحقيق أهداف الأمم المتحدة للتنمية المستدامة التالية (SDGs)

الوصول إلى المستند

10.1016/j.jclepro.2023.137885

الملفات والروابط الأخرى

قم بذكر هذا

A multi-model data fusion methodology for reservoir water quality based on machine learning algorithms and bayesian maximum entropy. / Zamani, Mohammad G.; Nikoo, Mohammad Reza; Niknazar, Fereshteh وآخرون.
في: Journal of Cleaner Production, المجلد 416, 137885, ٠١.٠٩.٢٠٢٣.

نتاج البحث: المساهمة في مجلة › Article › مراجعة النظراء

@article{d28e93211f874140a65596cd636987f7,

title = "A multi-model data fusion methodology for reservoir water quality based on machine learning algorithms and bayesian maximum entropy",

abstract = "A major concern in the management of reservoirs is water quality because of the negative consequences it has on both environment and human life. Artificial Intelligence (AI) concept produces a reliable framework to recognize complicated and non-linear correlations between input and output data. Although various machine learning (ML) algorithms in recent studies were employed to predict water quality variables, the existing literature lacks exploring the combination of these algorithms, which has the potential to significantly amplify the outcomes achieved by individual models. Thus, the current study aims to bridge this knowledge gap by evaluating the precision of Random Forest Regression (RFR), Support Vector Regression, Multilayer Perceptron (MLP), and Bayesian Maximum Entropy-based Fusion (BMEF) models to estimate such water quality variables as dissolved oxygen (DO) and chlorophyll-a (Chl-a). The comparisons were conducted in two primary stages: (1) a comparison of the outcomes of different ML algorithms with each other, and (2) comparing the ML algorithms' findings with that of the BMEF model, which considers uncertainty. These comparisons were evaluated using robust statistical measures, and, finally, to indicate the utility and efficacy of the newly introduced framework, it was efficiently utilized in Wadi Dayqah Dam, which is situated in Oman. The findings indicated that, throughout both training and testing phases, the BMEF model outperformed individual machine learning models, namely MLP, RFR, and SVR by 5%, 26%, and 10%, respectively, when R2 and Chl-a are considered as evaluation index and water quality variables, respectively. Additionally, as the individual ML models are not capable of predicting electrical conductivity and oxidation-reduction potential efficiently, the BMEF model leads to better results by R2=0.89, which outperforms MLP (R2=0.81), RFR (R2=0.79), and SVR (R2=0.62) for oxidation-reduction potential. Regarding the study limits of the present study, spatio-temporal data should be collected over a long time to increase the data frequency and reduce the uncertainty related to climate variability.",

keywords = "Bayesian maximum entropy fusion (BMEF), Fusion model, Machine learning models, Water quality estimation",

author = "Zamani, {Mohammad G.} and Nikoo, {Mohammad Reza} and Fereshteh Niknazar and Ghazi Al-Rawas and Malik Al-Wardy and Gandomi, {Amir H.}",

note = "Publisher Copyright: {\textcopyright} 2023 Elsevier Ltd",

year = "2023",

month = sep,

day = "1",

doi = "10.1016/j.jclepro.2023.137885",

language = "English",

volume = "416",

journal = "Journal of Cleaner Production",

issn = "0959-6526",

publisher = "Elsevier Limited",

}

TY - JOUR

T1 - A multi-model data fusion methodology for reservoir water quality based on machine learning algorithms and bayesian maximum entropy

AU - Zamani, Mohammad G.

AU - Nikoo, Mohammad Reza

AU - Niknazar, Fereshteh

AU - Al-Rawas, Ghazi

AU - Al-Wardy, Malik

AU - Gandomi, Amir H.

PY - 2023/9/1

Y1 - 2023/9/1

N2 - A major concern in the management of reservoirs is water quality because of the negative consequences it has on both environment and human life. Artificial Intelligence (AI) concept produces a reliable framework to recognize complicated and non-linear correlations between input and output data. Although various machine learning (ML) algorithms in recent studies were employed to predict water quality variables, the existing literature lacks exploring the combination of these algorithms, which has the potential to significantly amplify the outcomes achieved by individual models. Thus, the current study aims to bridge this knowledge gap by evaluating the precision of Random Forest Regression (RFR), Support Vector Regression, Multilayer Perceptron (MLP), and Bayesian Maximum Entropy-based Fusion (BMEF) models to estimate such water quality variables as dissolved oxygen (DO) and chlorophyll-a (Chl-a). The comparisons were conducted in two primary stages: (1) a comparison of the outcomes of different ML algorithms with each other, and (2) comparing the ML algorithms' findings with that of the BMEF model, which considers uncertainty. These comparisons were evaluated using robust statistical measures, and, finally, to indicate the utility and efficacy of the newly introduced framework, it was efficiently utilized in Wadi Dayqah Dam, which is situated in Oman. The findings indicated that, throughout both training and testing phases, the BMEF model outperformed individual machine learning models, namely MLP, RFR, and SVR by 5%, 26%, and 10%, respectively, when R2 and Chl-a are considered as evaluation index and water quality variables, respectively. Additionally, as the individual ML models are not capable of predicting electrical conductivity and oxidation-reduction potential efficiently, the BMEF model leads to better results by R2=0.89, which outperforms MLP (R2=0.81), RFR (R2=0.79), and SVR (R2=0.62) for oxidation-reduction potential. Regarding the study limits of the present study, spatio-temporal data should be collected over a long time to increase the data frequency and reduce the uncertainty related to climate variability.

AB - A major concern in the management of reservoirs is water quality because of the negative consequences it has on both environment and human life. Artificial Intelligence (AI) concept produces a reliable framework to recognize complicated and non-linear correlations between input and output data. Although various machine learning (ML) algorithms in recent studies were employed to predict water quality variables, the existing literature lacks exploring the combination of these algorithms, which has the potential to significantly amplify the outcomes achieved by individual models. Thus, the current study aims to bridge this knowledge gap by evaluating the precision of Random Forest Regression (RFR), Support Vector Regression, Multilayer Perceptron (MLP), and Bayesian Maximum Entropy-based Fusion (BMEF) models to estimate such water quality variables as dissolved oxygen (DO) and chlorophyll-a (Chl-a). The comparisons were conducted in two primary stages: (1) a comparison of the outcomes of different ML algorithms with each other, and (2) comparing the ML algorithms' findings with that of the BMEF model, which considers uncertainty. These comparisons were evaluated using robust statistical measures, and, finally, to indicate the utility and efficacy of the newly introduced framework, it was efficiently utilized in Wadi Dayqah Dam, which is situated in Oman. The findings indicated that, throughout both training and testing phases, the BMEF model outperformed individual machine learning models, namely MLP, RFR, and SVR by 5%, 26%, and 10%, respectively, when R2 and Chl-a are considered as evaluation index and water quality variables, respectively. Additionally, as the individual ML models are not capable of predicting electrical conductivity and oxidation-reduction potential efficiently, the BMEF model leads to better results by R2=0.89, which outperforms MLP (R2=0.81), RFR (R2=0.79), and SVR (R2=0.62) for oxidation-reduction potential. Regarding the study limits of the present study, spatio-temporal data should be collected over a long time to increase the data frequency and reduce the uncertainty related to climate variability.

KW - Bayesian maximum entropy fusion (BMEF)

KW - Fusion model

KW - Machine learning models

KW - Water quality estimation

UR - http://www.scopus.com/inward/record.url?scp=85163994852&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85163994852&partnerID=8YFLogxK

UR - https://www.mendeley.com/catalogue/e77dc309-cdab-32b7-a4fc-dd1d0e2c98c7/

U2 - 10.1016/j.jclepro.2023.137885

DO - 10.1016/j.jclepro.2023.137885

M3 - Article

AN - SCOPUS:85163994852

SN - 0959-6526

VL - 416

JO - Journal of Cleaner Production

JF - Journal of Cleaner Production

M1 - 137885

ER -