CLUSTERING CATEGORICAL DATA USING k-MODES BASED ON CUCKOO SEARCH OPTIMIZATION ALGORITHM

Abstract
Cluster analysis is the unsupervised learning technique that finds the interesting patterns in the data objects without knowing class labels. Most of the real world dataset consists of categorical data. For example, social media analysis may have the categorical data like the gender as male or female. The k-modes clustering algorithm is the most widely used to group the categorical data, because it is easy to implement and efficient to handle the large amount of data. However, due to its random selection of initial centroids, it provides the local optimum solution. There are number of optimization algorithms are developed to obtain global optimum solution. Cuckoo Search algorithm is the population based metaheuristic optimization algorithms to provide the global optimum solution. Methods: In this paper, k-modes clustering algorithm is combined with Cuckoo Search algorithm to obtain the global optimum solution. Results: Experiments are conducted with benchmark datasets and the results are compared with k-modes and Particle Swarm Optimization with k-modes to prove the efficiency of the proposed algorithm.

Authors
K Lakshmi1, N Karthikeyani Visalakshi2, S Shanthi3, S Parvathavarthini4
Kongu Engineering College, India1,3,4, NKR Government Arts College for Women, India2

Keywords
Cluster Analysis, k-Modes, Cuckoo Search Optimization, Local Optima, Initial Centroids
Published By :
ICTACT
Published In :
ICTACT Journal on Soft Computing
( Volume: 8 , Issue: 1 )
Date of Publication :
October 2017

Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.