DIFFUSION MODELS FOR HIGH-QUALITY IMAGE SYNTHESIS USING BALANCING MODEL COMPLEXITY WITH TRAINING EFFICIENCY

ICTACT Journal on Soft Computing ( Volume: 15 , Issue: 3 )

Abstract

The synthesis of high-quality images has become a cornerstone of advancements in generative modeling, with diffusion models emerging as a prominent method due to their ability to produce detailed and realistic visuals. However, achieving high fidelity often demands extensive computational resources and prolonged training durations, posing significant challenges in balancing model complexity with training efficiency. Traditional methods struggle to optimize both quality and efficiency, leaving room for innovation in design and implementation. To address this challenge, a novel diffusion-based framework is proposed that incorporates a hybrid noise scheduling mechanism and adaptive model scaling. The method uses an optimized U-Net architecture augmented with attention mechanisms to ensure high-resolution feature capture while reducing computational overhead. Furthermore, a diffusion-based training approach gradually increases model complexity, enabling faster convergence and improved efficiency. Experimental results demonstrate the efficacy of the proposed framework. On the CelebA-HQ dataset, it achieves a Fréchet Inception Distance (FID) score of 5.2, outperforming state-of-art diffusion models with a 15% reduction in training time. When tested on the CIFAR-10 dataset, the framework produces an FID score of 2.8, marking a significant improvement over existing benchmarks. These results highlight the model’s ability to maintain high image quality while substantially reducing computational costs, making it feasible for resource-constrained environments. The proposed approach bridges the gap between computational efficiency and image synthesis quality, paving the way for broader applications in industries such as gaming, design, and content generation, where high-quality visuals are critical.

Authors

S. Vadhana Kumari1, Shano Maria Selvan2, S. Brilly Sangeetha3, Adeline Sneha4
Vimal Jyothi Engineering College, India1, University of Manchester, United Kingdom2, IES College of Engineering, India3, Asia Pacific University of Technology and Innovation, Malaysia4

Keywords

Diffusion Models, Image Synthesis, Training Efficiency, Model Complexity, Fréchet Inception Distance

Published By
ICTACT
Published In
ICTACT Journal on Soft Computing
( Volume: 15 , Issue: 3 )
Date of Publication
January 2025
Pages
3625 - 3633
Page Views
279
Full Text Views
11

ICT Academy is an initiative of the Government of India in collaboration with the state Governments and Industries. ICT Academy is a not-for-profit society, the first of its kind pioneer venture under the Public-Private-Partnership (PPP) model

Contact Us

ICT Academy
Module No E6 -03, 6th floor Block - E
IIT Madras Research Park
Kanagam Road, Taramani,
Chennai 600 113,
Tamil Nadu, India

For Journal Subscription: journalsales@ictacademy.in

For further Queries and Assistance, write to us at: ictacademy.journal@ictacademy.in