ResearchSpace

Long short term memory water quality predictive model discrepancy mitigation through genetic algorithm optimisation and ensemble modeling

Show simple item record

dc.contributor.author Dheda, D
dc.contributor.author Cheng, L
dc.contributor.author Abu-Mahfouz, Adnan MI
dc.date.accessioned 2022-11-21T08:44:50Z
dc.date.available 2022-11-21T08:44:50Z
dc.date.issued 2022-02
dc.identifier.citation Dheda, D., Cheng, L. & Abu-Mahfouz, A.M. 2022. Long short term memory water quality predictive model discrepancy mitigation through genetic algorithm optimisation and ensemble modeling. <i>IEEE Access.</i> http://hdl.handle.net/10204/12530 en_ZA
dc.identifier.issn 2169-3536
dc.identifier.uri DOI: 10.1109/ACCESS.2022.3152818
dc.identifier.uri http://hdl.handle.net/10204/12530
dc.description.abstract A predictive long short-term memory (LSTM) model developed on a particular water quality dataset will only apply to the dataset and may fail to make an accurate prediction on another dataset. This paper focuses on improving LSTM model tolerance by mitigating discrepancies in model prediction capability that arises when a model is applied to different datasets. Two predictive LSTM models are developed from the water quality datasets, Baffle and Burnett, and are optimised using the metaheuristic genetic algorithm (GA) to create hybrid GA-optimised LSTM models that are subsequently combined with a linear weight-based technique to develop a tolerant predictive ensemble model. The models successfully predict river water quality in terms of dissolved oxygen concentration. After GA-optimisation, the RMSE values of the Baffle and Burnett models decrease by 42.42% and 10.71%, respectively. Furthermore, two ensemble models are developed from the GA-hybrid models, namely the average ensemble and the optimal weighted ensemble. The GA-Baffle RMSE values decrease by 5.05% for the average ensemble and 6.06% for the weighted ensemble, and the GA-Burnett RMSE values decrease by 7.84% and 8.82%, respectively. When tested on unseen and unrelated datasets, the models make accurate predictions, indicating the applicability of the models in domains outside the water sector. The consistent and similar performance of the models on any dataset illustrates the successful mitigation of discrepancies in the predictive capacity of individual LSTM models by the proposed ensemble scheme. The observed model performance highlights the datasets on which the models could potentially make accurate predictions. en_US
dc.format Fulltext en_US
dc.language.iso en en_US
dc.relation.uri https://ieeexplore.ieee.org/document/9717258 en_US
dc.source IEEE Access en_US
dc.subject Long short-term memory en_US
dc.subject LSTM en_US
dc.subject Genetic algorithm en_US
dc.subject Ensemble model en_US
dc.subject Weight based model fusion en_US
dc.subject Water conservation en_US
dc.title Long short term memory water quality predictive model discrepancy mitigation through genetic algorithm optimisation and ensemble modeling en_US
dc.type Article en_US
dc.description.pages 24638 - 24658 en_US
dc.description.note This work is licensed under a Creative Commons Attribution 4.0 License en_US
dc.description.cluster Next Generation Enterprises & Institutions en_US
dc.description.impactarea EDT4IR Management en_US
dc.identifier.apacitation Dheda, D., Cheng, L., & Abu-Mahfouz, A. M. (2022). Long short term memory water quality predictive model discrepancy mitigation through genetic algorithm optimisation and ensemble modeling. <i>IEEE Access</i>, http://hdl.handle.net/10204/12530 en_ZA
dc.identifier.chicagocitation Dheda, D, L Cheng, and Adnan MI Abu-Mahfouz "Long short term memory water quality predictive model discrepancy mitigation through genetic algorithm optimisation and ensemble modeling." <i>IEEE Access</i> (2022) http://hdl.handle.net/10204/12530 en_ZA
dc.identifier.vancouvercitation Dheda D, Cheng L, Abu-Mahfouz AM. Long short term memory water quality predictive model discrepancy mitigation through genetic algorithm optimisation and ensemble modeling. IEEE Access. 2022; http://hdl.handle.net/10204/12530. en_ZA
dc.identifier.ris TY - Article AU - Dheda, D AU - Cheng, L AU - Abu-Mahfouz, Adnan MI AB - A predictive long short-term memory (LSTM) model developed on a particular water quality dataset will only apply to the dataset and may fail to make an accurate prediction on another dataset. This paper focuses on improving LSTM model tolerance by mitigating discrepancies in model prediction capability that arises when a model is applied to different datasets. Two predictive LSTM models are developed from the water quality datasets, Baffle and Burnett, and are optimised using the metaheuristic genetic algorithm (GA) to create hybrid GA-optimised LSTM models that are subsequently combined with a linear weight-based technique to develop a tolerant predictive ensemble model. The models successfully predict river water quality in terms of dissolved oxygen concentration. After GA-optimisation, the RMSE values of the Baffle and Burnett models decrease by 42.42% and 10.71%, respectively. Furthermore, two ensemble models are developed from the GA-hybrid models, namely the average ensemble and the optimal weighted ensemble. The GA-Baffle RMSE values decrease by 5.05% for the average ensemble and 6.06% for the weighted ensemble, and the GA-Burnett RMSE values decrease by 7.84% and 8.82%, respectively. When tested on unseen and unrelated datasets, the models make accurate predictions, indicating the applicability of the models in domains outside the water sector. The consistent and similar performance of the models on any dataset illustrates the successful mitigation of discrepancies in the predictive capacity of individual LSTM models by the proposed ensemble scheme. The observed model performance highlights the datasets on which the models could potentially make accurate predictions. DA - 2022-02 DB - ResearchSpace DP - CSIR J1 - IEEE Access KW - Long short-term memory KW - LSTM KW - Genetic algorithm KW - Ensemble model KW - Weight based model fusion KW - Water conservation LK - https://researchspace.csir.co.za PY - 2022 SM - 2169-3536 T1 - Long short term memory water quality predictive model discrepancy mitigation through genetic algorithm optimisation and ensemble modeling TI - Long short term memory water quality predictive model discrepancy mitigation through genetic algorithm optimisation and ensemble modeling UR - http://hdl.handle.net/10204/12530 ER - en_ZA
dc.identifier.worklist 26132 en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record