The likelihood that an earthquake will occur in a specific location, within a specific time frame, and with ground motion intensity greater than a specific threshold is known as a seismic hazard. Predicting these types of hazards is crucial since doing so can enable early warnings, which can lessen the negative effects. Research is currently being executed in the field of machine learning to predict seismic events based on previously recorded incidents. However, because these events happen so infrequently, this presents a class imbalance problem to the machine learning or deep learning learners. As a result, this study provided a comparison of the performance of popular over-sampling techniques that seek to even out class imbalance in seismic events data. Specifically, this work applied SMOTE, SMOTENC, SMOTEN, BorderlineSMOTE, SVMSMOTE, and ADASYN to an open source Seismic Bumps dataset then trained several machine learning classifiers with stratified K-fold cross-validation for seismic hazard detection. The SVMSMOTE algorithm was the best over-sampling method as it produced classifiers with the highest overall accuracy, F1 score, recall, and precision of 100%, respectively, whereas the ADASYN over-sampling methodology showed the lowest performance in all the reported metrices of all the models. To our understanding, no research has been done comparing the effectiveness of the aforementioned over-sampling techniques for tasks involving seismic events.
Reference:
Mokoatle, M., Coleman, T. & Mokilane, P.M. 2023. A comparative study of over-sampling techniques as applied to seismic events. http://hdl.handle.net/10204/13445 .
Mokoatle, M., Coleman, T., & Mokilane, P. M. (2023). A comparative study of over-sampling techniques as applied to seismic events. http://hdl.handle.net/10204/13445
Mokoatle, Mpho, Toshka Coleman, and Paul M Mokilane. "A comparative study of over-sampling techniques as applied to seismic events." The Southern African Conference on AI Research (SACAIR 2023), Muldersdrift, Gauteng, 4-8 December 2023 (2023): http://hdl.handle.net/10204/13445
Mokoatle M, Coleman T, Mokilane PM, A comparative study of over-sampling techniques as applied to seismic events; 2023. http://hdl.handle.net/10204/13445 .