VIRTUAL DISK BASED DATA CLUSTERING MAPREDUCE FRAMEWORK

ICTACT Journal on Data Science and Machine Learning ( Volume: 1 , Issue: 2 )

Abstract

vioft2nntf2t|tblJournal|Abstract_paper|0xf4ffc7952b000000ca4b000001000b00
One of the important methods in data mining is segmentation. As each field is extended and digitized, large data sets are developed quickly. These wide clustering of data sets poses a problem for conventional sequential segmentation algorithms due to the enormous time consumed for development. Hence, distributed parallel architecture and algorithms are useful in meeting the efficiency and scalability requirements for clustering large data sets. In this analysis, we use MapReduce programming model to develop and experiment a parallel SVM algorithm, and compare the result with concurrent SVM for clustering the changing document database size. The result shows that the SVM proposed get better performance than existing methods.

Authors

M Vijayalakshmi, T Vinodh Kannan
Mookambigai College of Engineering, India

Keywords

Cloud Computing, Map Reduce, Clustering, Domain Clustering

Published By
ICTACT
Published In
ICTACT Journal on Data Science and Machine Learning
( Volume: 1 , Issue: 2 )
Date of Publication
March 2020
Pages
64-67
DOI

ICT Academy is an initiative of the Government of India in collaboration with the state Governments and Industries. ICT Academy is a not-for-profit society, the first of its kind pioneer venture under the Public-Private-Partnership (PPP) model

Contact Us

ICT Academy
Module No E6 -03, 6th floor Block - E
IIT Madras Research Park
Kanagam Road, Taramani,
Chennai 600 113,
Tamil Nadu, India

For Journal Subscription: journalsales@ictacademy.in

For further Queries and Assistance, write to us at: ictacademy.journal@ictacademy.in