MULTI-DOCUMENT TEXT SUMMARIZATION USING CLUSTERING TECHNIQUES AND LEXICAL CHAINING

ICTACT Journal on Soft Computing ( Volume: 1 , Issue: 1 )

Abstract

vioft2nntf2t|tblJournal|Abstract_paper|0xf4ff410b01000000fb3f000001000700
This paper investigates the use of clustering and lexical chains to produce coherent summaries of multiple documents in text format to generate an indicative, less redundant summary. The summary is designed as per user’s requirement of conciseness i.e., the documents are summarized according to the percentage input by the user. For achieving the above, various clustering techniques are used. Clustering is done at two levels, one at single document level and then at multi-document level. The clustered sentences are scored based on five different methods and lexically linked to produce the final summary in a text document.

Authors

S. Saraswathi1, R. Arti2
Pondicherry Engineering College, Pondicherry, India1, Microsoft R & D India Private Limited, Hyderabad, India2

Keywords

Hierarchical Clustering, Lexical Chaining, Precision, Recall

Published By
ICTACT
Published In
ICTACT Journal on Soft Computing
( Volume: 1 , Issue: 1 )
Date of Publication
July 2010
Pages
23 - 29

ICT Academy is an initiative of the Government of India in collaboration with the state Governments and Industries. ICT Academy is a not-for-profit society, the first of its kind pioneer venture under the Public-Private-Partnership (PPP) model

Contact Us

ICT Academy
Module No E6 -03, 6th floor Block - E
IIT Madras Research Park
Kanagam Road, Taramani,
Chennai 600 113,
Tamil Nadu, India

For Journal Subscription: journalsales@ictacademy.in

For further Queries and Assistance, write to us at: ictacademy.journal@ictacademy.in