we aim to enhance the performance of several classical machine learning algorithms, including Decision Trees (DT), Logistic Regression (LR), and Support Vector Machines (SVM), in customer churn prediction tasks using CTGAN-ENN and cost-sensitive learning method.
- Optimizing customer churn prediction result by extend the CTGAN-ENN with cost-sensitive learning perspective
- Evaluate the performance of prediction using F1-score, AUC, and G-Mean metric
- Ivestigate how robust is the classical machine learning algorithm on this area
CostLearnGAN framework surpasses the CTGAN-ENN on average on AUC-ROC, F1-Score, and G-Mean evaluation metrics, the result also shows CTGAN-ENN-CS was more robust than CTGAN-ENN in all classical machine learning algorithms.
- Cost-Sensitive learning was able to improve hybrid sampling method on classical machine learning (DT,SVM,LR)
- CostLearnGAN improved performance on customer churn prediction on AUC, F1-Score and G-Mean metrics
- CostLearnGAN was the most robust performance on all algorithm
Experiments another hybrid combination on CTGAN method such as adding anomaly detection method on CTGAN to make sure the synthetic data produced are not outliers.
This research was conducted as part of ASEAN GMS grant and part of AIDA (Applied Intelligence and Data Analytics) lab in College of Computing, Khon Kaen University, Thailand. This study also conducted in collaboration with Rebecca Lab, Feng Chia University, Taiwan.
@misc{costlearngan,
author = {I Nyoman Mahayasa Adiputra, Paweena Wanchai, Pei-Chun Lin},
title = {Optimized customer churn prediction using tabular generative adversarial network (GAN)-based hybrid sampling method and cost-sensitive learning},
year = {2025},
url = {https://doi.org/10.7717/peerj-cs.2949}
}


