Title: Ensemble classifiers for bankruptcy prediction using SMOTE and RFECV
Authors: T. Shahana; Vilvanathan Lavanya; Aamir Rashid Bhat
Addresses: Department of Management Studies, National Institute of Technology Tiruchirappalli, Thuvakkudi, Trichy, 620015, India ' Department of Management Studies, National Institute of Technology Tiruchirappalli, Thuvakkudi, Trichy, 620015, India ' Department of Corporate Secretaryship Accounting and Finance, SRM Institute of Science and Technology, Kattankulathur, Chennai, 603203, India
Abstract: This research investigates the impact of preprocessing strategies, namely feature selection (utilising correlation and recursive feature elimination with cross-validation) and class imbalance handling (employing synthetic minority oversampling technique), on the performance of prediction models using ensemble-learning techniques (random forest, AdaBoost, gradient boosting decision tree, extreme gradient boosting, bagging, LightGBM and extra tree classifier). The study focuses on the Polish bankruptcy dataset to assess the effectiveness of these preprocessing approaches. Experimental results demonstrate that adopting class imbalance handling significantly influences classifier performance compared to feature selection alone. Interestingly, hyperparameter tuning and feature selection exhibit limited impact on classifier performance. Among the ensemble-learning techniques tested, the adaptive boosting classifier shows consistently poor performance throughout the study period, followed by the bagging classifier with statistical significance. These findings shed light on the importance of selecting appropriate preprocessing strategies to improve the performance of ensemble-based prediction models in bankruptcy prediction tasks.
Keywords: bankruptcy prediction; ensemble classifiers; missing value imputation; SMOTE; correlation; RFECV.
DOI: 10.1504/IJENM.2024.137456
International Journal of Enterprise Network Management, 2024 Vol.15 No.1, pp.109 - 132
Received: 30 Mar 2022
Accepted: 06 Jul 2023
Published online: 19 Mar 2024 *