WEB LINK SPAM IDENTIFICATION INSPIRED BY ARTIFICIAL IMMUNE SYSTEM AND THE IMPACT OF TPP-FCA FEATURE SELECTION ON SPAM CLASSIFICATION
Abstract
vioft2nntf2t|tblJournal|Abstract_paper|0xf4ffe45a130000009332010001000200
Search engines are the doorsteps for retrieving required information from the web. Web spam is a bad method for improving the ranking and visibility of the web pages in search engine results. This paper addresses the problem of the link spam classification through the features of the web sites. Link related features retrieved from the website are used to discriminate the spam and non-spam sites. AIS inspired algorithms are applied for the dataset and results are evaluated. Artificial immune systems are machine learning systems inspired by the principles of the natural immunology. It comprises of supervised learning schemes which can be adapted for the wide range of the classification problems.UK- WEBSPAM-2007 Dataset [8] is used for the experiments. WEKA [9] is used to simulate the classifiers. Artificial Immune Recognition algorithm seems to perform well than the other classes. Best classification accuracy attained is 98.89 by AIRS1 Algorithm. This seems to be good when comparing with the other classifiers accuracy available on the existing literature.

Authors
S. K. Jayanthi1, S. Sasikala2
Vellalar College for Women, India 1, KSR College of Arts and Science, India 2

Keywords
Web Spam, Search Engine, TPP, FCA, AIRS
Yearly Full Views
JanuaryFebruaryMarchAprilMayJuneJulyAugustSeptemberOctoberNovemberDecember
000000000000
Published By :
ICTACT
Published In :
ICTACT Journal on Soft Computing
( Volume: 4 , Issue: 1 , Pages: 633-644 )
Date of Publication :
October 2013
Page Views :
123
Full Text Views :

Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.