FEATURE SUB-SPACING BASED STACKING FOR EFFECTIVE IMBALANCE HANDLING IN SENSITIVE DATA

Abstract
Several real world classification applications suffer from an issue called data imbalance. Handling data imbalance is crucial in developing an effective classification system. This work presents an effective classifier ensemble model, Feature Sub-spacing Stacking Model (FSSM) that has been designed to operate on highly imbalanced, complex and sensitive data. The FSSM technique is based on creating subspace of features, to aid in the reduction of data complexity and also to handle data imbalance. First level trains models based on these features, which is followed by creating a stacking architecture. The second level stacking architecture trains on the predictions from the first level base models. This has enabled better and qualitative predictions. Experiments were conducted on bank data and also the NSL-KDD data. Results reveal highly effective performances compared to the existing models.

Authors
S Josephine Theresa, D J Evanjaline
Rajah Serfoji Government College, India

Keywords
Classification, Data imbalance, Ensemble, Stacking, Feature Sub-spacing
Published By :
ICTACT
Published In :
ICTACT Journal on Soft Computing
( Volume: 12 , Issue: 1 )
Date of Publication :
October 2021
DOI :

Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.