FEATURE SUB-SPACING BASED STACKING FOR EFFECTIVE IMBALANCE HANDLING IN SENSITIVE DATA

ICTACT Journal on Soft Computing ( Volume: 12 , Issue: 1 )

Abstract

Several real world classification applications suffer from an issue called data imbalance. Handling data imbalance is crucial in developing an effective classification system. This work presents an effective classifier ensemble model, Feature Sub-spacing Stacking Model (FSSM) that has been designed to operate on highly imbalanced, complex and sensitive data. The FSSM technique is based on creating subspace of features, to aid in the reduction of data complexity and also to handle data imbalance. First level trains models based on these features, which is followed by creating a stacking architecture. The second level stacking architecture trains on the predictions from the first level base models. This has enabled better and qualitative predictions. Experiments were conducted on bank data and also the NSL-KDD data. Results reveal highly effective performances compared to the existing models.

Authors

S Josephine Theresa, D J Evanjaline
Rajah Serfoji Government College, India

Keywords

Classification, Data imbalance, Ensemble, Stacking, Feature Sub-spacing

Published By
ICTACT
Published In
ICTACT Journal on Soft Computing
( Volume: 12 , Issue: 1 )
Date of Publication
October 2021
Pages
2510-2514

ICT Academy is an initiative of the Government of India in collaboration with the state Governments and Industries. ICT Academy is a not-for-profit society, the first of its kind pioneer venture under the Public-Private-Partnership (PPP) model

Contact Us

ICT Academy
Module No E6 -03, 6th floor Block - E
IIT Madras Research Park
Kanagam Road, Taramani,
Chennai 600 113,
Tamil Nadu, India

For Journal Subscription: journalsales@ictacademy.in

For further Queries and Assistance, write to us at: ictacademy.journal@ictacademy.in