BERT-SIMILARITY-BASED FIREFLY (BSBF) ALGORITHM FOR MULTIPLE DOCUMENT SUMMARIZATION

ICTACT Journal on Data Science and Machine Learning ( Volume: 4 , Issue: 4 )

Abstract

Extracting knowledge from various sources is a tedious task. Multi-document Text Summarization (MDTS) aims to extrapolate data from various sources and present it in a concise, cohesive form, in a way that is simple for the reader to comprehend and yet ensures that they obtain the important information. A meta-heuristic optimization algorithm called the firefly algorithm is utilized to generate a summary. Topic Relevance Factor, Coherency Factor, and Readability Factor are used to establish the fitness function. We use BERT-based similarity to calculate these factors which are then later input to the fitness function. The experiments are conducted on DUC 2003 and DUC 2004 datasets. The suggested algorithm’s performance is compared to that of previous meta-heuristic and graph-based techniques.

Authors

Rushi Desai, Saloni Patel, Shourya Kothari, Pankaj Sonawane
Dwarkadas Jivanlal Sanghvi College of Engineering, India

Keywords

Multiple document summarization, Meta-heuristic optimization, Firefly algorithm, ROUGE, BERTscore

Published By
ICTACT
Published In
ICTACT Journal on Data Science and Machine Learning
( Volume: 4 , Issue: 4 )
Date of Publication
September 2023
Pages
520 - 525

ICT Academy is an initiative of the Government of India in collaboration with the state Governments and Industries. ICT Academy is a not-for-profit society, the first of its kind pioneer venture under the Public-Private-Partnership (PPP) model

Contact Us

ICT Academy
Module No E6 -03, 6th floor Block - E
IIT Madras Research Park
Kanagam Road, Taramani,
Chennai 600 113,
Tamil Nadu, India

For Journal Subscription: journalsales@ictacademy.in

For further Queries and Assistance, write to us at: ictacademy.journal@ictacademy.in