vioft2nntf2t|tblJournal|Abstract_paper|0xf4ffd1f323000000d245000001000c00
Big Data is now the most talked about research subject. Over the year with the internet and storage space expansions vast swaths of data are available for would be searcher. About a decade ago when a content was searched, due to minimum amount of content often you end up with accurate set of results. But nowadays most of the data, if not all are sometimes vague and not even sometime pertain to area of search it was indented to. Hence here a novel approach is presented to perform data cleaning using a simple but effective fuzzy rule to weed out data that won’t produce accurate data.