MULTI-DOCUMENT TEXT SUMMARIZATION USING CLUSTERING TECHNIQUES AND LEXICAL CHAINING

ICTACT Journal on Soft Computing ( Volume: 1 , Issue: 1 )

Abstract

vioft2nntf2t|tblJournal|Abstract_paper|0xf4ff410b01000000fb3f000001000700
This paper investigates the use of clustering and lexical chains to produce coherent summaries of multiple documents in text format to generate an indicative, less redundant summary. The summary is designed as per user’s requirement of conciseness i.e., the documents are summarized according to the percentage input by the user. For achieving the above, various clustering techniques are used. Clustering is done at two levels, one at single document level and then at multi-document level. The clustered sentences are scored based on five different methods and lexically linked to produce the final summary in a text document.

Authors

S. Saraswathi1, R. Arti2
Pondicherry Engineering College, Pondicherry, India1, Microsoft R & D India Private Limited, Hyderabad, India2

Keywords

Hierarchical Clustering, Lexical Chaining, Precision, Recall

Published By
ICTACT
Published In
ICTACT Journal on Soft Computing
( Volume: 1 , Issue: 1 )
Date of Publication
July 2010
Pages
23 - 29