HIGH-EFFICIENCY AERIAL DETECTION WITH VISION TRANSFORMERS AND PYSPARK IN A DISTRIBUTED FRAMEWORK
Abstract
Image processing tasks in Computer vision such as segmentation, object detection, and classification are foundational in geosciences, driving advancements by enabling precise analysis and interpretation of Earth’s dynamic systems and landscapes. The processing of high-resolution aerial images for object detection presents significant challenges, including the need for high detection accuracy and the ability to handle vast datasets effectively. Traditional methods often struggle with the scale and complexity of such tasks, necessitating innovations that can leverage distributed computing to meet these demands. This study introduces a groundbreaking framework that integrates Vision Transformers, a cutting-edge architecture for object detection, with PySpark’s distributed computing capabilities. This inference model significantly enhances batch inference processing efficiency of voluminous datasets, enabling the analysis of high-resolution aerial imagery with notable accuracy. By utilizing Resilient Distributed Datasets (RDDs), the research offers a detailed algorithmic analysis that reveals the computational advantages of this PySpark-based approach. The proposed Vision Transformer-PySpark framework is evaluated on the DOTA benchmark dataset for aerial images, demonstrating its scalability and superior performance as the amount of computing nodes rise, achieving improved scalability. The comparison of this framework against cutting edge object detection models underscores it’s effectiveness and scalability, setting a new standard for efficient, large-scale aerial image analysis in distributed computing environments.

Authors
Arshi Jamal, K. Ramesh
Karnataka State Akkamahadevi Women’s University, India

Keywords
Object Detection, Aerial Detection, DOTA, Transformers, Distributed Computing
Yearly Full Views
JanuaryFebruaryMarchAprilMayJuneJulyAugustSeptemberOctoberNovemberDecember
700000000000
Published By :
ICTACT
Published In :
ICTACT Journal on Data Science and Machine Learning
( Volume: 6 , Issue: 1 , Pages: 713 - 720 )
Date of Publication :
December 2024
Page Views :
58
Full Text Views :
7

Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.