HIGH-EFFICIENCY AERIAL DETECTION WITH VISION TRANSFORMERS AND PYSPARK IN A DISTRIBUTED FRAMEWORK

ICTACT Journal on Data Science and Machine Learning ( Volume: 6 , Issue: 1 )

Abstract

Image processing tasks in Computer vision such as segmentation, object detection, and classification are foundational in geosciences, driving advancements by enabling precise analysis and interpretation of Earth’s dynamic systems and landscapes. The processing of high-resolution aerial images for object detection presents significant challenges, including the need for high detection accuracy and the ability to handle vast datasets effectively. Traditional methods often struggle with the scale and complexity of such tasks, necessitating innovations that can leverage distributed computing to meet these demands. This study introduces a groundbreaking framework that integrates Vision Transformers, a cutting-edge architecture for object detection, with PySpark’s distributed computing capabilities. This inference model significantly enhances batch inference processing efficiency of voluminous datasets, enabling the analysis of high-resolution aerial imagery with notable accuracy. By utilizing Resilient Distributed Datasets (RDDs), the research offers a detailed algorithmic analysis that reveals the computational advantages of this PySpark-based approach. The proposed Vision Transformer-PySpark framework is evaluated on the DOTA benchmark dataset for aerial images, demonstrating its scalability and superior performance as the amount of computing nodes rise, achieving improved scalability. The comparison of this framework against cutting edge object detection models underscores it’s effectiveness and scalability, setting a new standard for efficient, large-scale aerial image analysis in distributed computing environments.

Authors

Arshi Jamal, K. Ramesh
Karnataka State Akkamahadevi Women’s University, India

Keywords

Object Detection, Aerial Detection, DOTA, Transformers, Distributed Computing

Published By
ICTACT
Published In
ICTACT Journal on Data Science and Machine Learning
( Volume: 6 , Issue: 1 )
Date of Publication
December 2024
Pages
713 - 720

ICT Academy is an initiative of the Government of India in collaboration with the state Governments and Industries. ICT Academy is a not-for-profit society, the first of its kind pioneer venture under the Public-Private-Partnership (PPP) model

Contact Us

ICT Academy
Module No E6 -03, 6th floor Block - E
IIT Madras Research Park
Kanagam Road, Taramani,
Chennai 600 113,
Tamil Nadu, India

For Journal Subscription: journalsales@ictacademy.in

For further Queries and Assistance, write to us at: ictacademy.journal@ictacademy.in