ICTACT Journals

IMPROVED FEATURE EXTRACTION ON TEXT DOCUMENTS USING NEURAL NETWORK MODEL

ICTACT Journal on Soft Computing ( Volume: 11 , Issue: 2 )

Abstract

In natural language processing, the text clustering plays a major role on reducing the text dimensionality. However, the lack of data models has made the clustering algorithm to face sparsity problems. The integration with deep learning has resolved the problem of scarce knowledge on text documents. However, deeper architectures learn such redundant features, which limit the efficiency of solutions. In this paper, a complete extraction of features from text document using neural network model. The neural network model utilizes feed forward mechanism and a type of unsupervised learning that denoises the corrupted input features. The reconstructed feature is used for initialing the feed forward network. This method reduces the manual labelling in the process of screening. For evaluation, series of experiments are conducted to investigate the performance of the method over the text datasets with various conventional algorithms.

Authors

V Kumaresan ¹, R Nagarajan²
Annamalai University, India¹, Annamalai University, India²

Keywords

Text Document, Feature Extraction, Neural Network, Denoising

Published By

ICTACT

Published In

ICTACT Journal on Soft Computing
( Volume: 11 , Issue: 2 )

Date of Publication

January 2021

Pages

2279-2282

DOI

10.21917/ijsc.2021.0325