Water, Vol. 17, Pages 2994: A Bibliometric-Systematic Literature Review (B-SLR) of Machine Learning-Based Water Quality Prediction: Trends, Gaps, and Future Directions

Greenberg October 17, 2025 in News - 2 Minutes

Water, Vol. 17, Pages 2994: A Bibliometric-Systematic Literature Review (B-SLR) of Machine Learning-Based Water Quality Prediction: Trends, Gaps, and Future Directions

Water doi: 10.3390/w17202994

Authors:
Jeimmy Adriana Muñoz-Alegría
Jorge Núñez
Ricardo Oyarzún
Cristian Alfredo Chávez
José Luis Arumí
Lien Rodríguez-López

Predicting the quality of freshwater, both surface and groundwater, is essential for the sustainable management of water resources. This study collected 1822 articles from the Scopus database (2000&ndash;2024) and filtered them using Topic Modeling to create the study corpus. The B-SLR analysis identified exponential growth in scientific publications since 2020, indicating that this field has reached a stage of maturity. The results showed that the predominant techniques for predicting water quality, both for surface and groundwater, fall into three main categories: (i) ensemble models, with Bagging and Boosting representing 43.07% and 25.91%, respectively, particularly random forest (RF), light gradient boosting machine (LightGBM), and extreme gradient boosting (XGB), along with their optimized variants; (ii) deep neural networks such as long short-term memory (LSTM) and convolutional neural network (CNN), which excel at modeling complex temporal dynamics; and (iii) traditional algorithms like artificial neural network (ANN), support vector machines (SVMs), and decision tree (DT), which remain widely used. Current trends point towards the use of hybrid and explainable architectures, with increased application of interpretability techniques. Emerging approaches such as Generative Adversarial Network (GAN) and Group Method of Data Handling (GMDH) for data-scarce contexts, Transfer Learning for knowledge reuse, and Transformer architectures that outperform LSTM in time series prediction tasks were also identified. Furthermore, the most studied water bodies (e.g., rivers, aquifers) and the most commonly used water quality indicators (e.g., WQI, EWQI, dissolved oxygen, nitrates) were identified. The B-SLR and Topic Modeling methodology provided a more robust, reproducible, and comprehensive overview of AI/ML/DL models for freshwater quality prediction, facilitating the identification of thematic patterns and research opportunities.

Source link

Jeimmy Adriana Muñoz-Alegría www.mdpi.com

Greenberg

Learn More →

Related Posts

Energies, Vol. 18, Pages 6240: Influence of Injection Well Location on Hydrogen Storage Capacity and Plume Migration in a Saline Aquifer: A Case Study from Central Poland

Remote Sensing, Vol. 17, Pages 3847: Investigating an Earthquake Surface Rupture Along the Kumysh Fault (Eastern Tianshan, Central Asia) from High-Resolution Topographic Data

Veterinary Sciences, Vol. 12, Pages 1130: Impact of Dietary Shrimp Waste on Physical Properties, Chemical Composition, Amino Acid Profile, and Antioxidant Levels of Breast Meat

Greenberg