Systems Engineering and Electronics ›› 2026, Vol. 48 ›› Issue (1): 34-43.doi: 10.12305/j.issn.1001-506X.2026.01.04

• Electronic Technology • Previous Articles     Next Articles

Text-to-image generation based on double-attention generative adversarial network

Zhenxing ZHANG(), Rennong YANG, Yonglin LI, Jialiang ZUO, Liping HU, Shuangyan CHEN   

  1. School of Air Traffic Control and Navigation,Air Force Engineering University,Xi’an 710051,China
  • Received:2024-04-03 Online:2026-01-25 Published:2026-02-11
  • Contact: Yonglin LI E-mail:2207621676@qq.com

Abstract:

To address three prominent issues in multi-stage text-to-image generation methods—prolonged convergence time due to training multiple neural networks, the architecture neglecting the image quality generated by early-stage generators, and the requirement of training multiple discriminators—a text-to-image generation model based on double-attention generative adversarial networks (DoubleGAN) is proposed. DoubleGAN incorporates both channel and pixel attention mechanisms, leveraging sentence vectors to guide the generator in focusing on channels and pixels closely associated with textual content. Meanwhile, a conditionally adaptive instance-wise layer normalization method is introduced, which can adjust the variation amplitudes of shapes and textures according to linguistic information, thereby significantly enhancing the visual-semantic alignment and improving the stability of the training process. Additionally, a novel visual loss is adopted to boost image resolution, ensuring that the generated images possess vivid shapes and perceptually uniform color distributions. Experimental results demonstrate that DoubleGAN achieves excellent performance, substantially increasing the inception score (IS) from 4.75 to 4.97 on the Caltech-UCSD Birds-200-2011(CUB Bird) dataset, indicating its practical application value.

Key words: text-to-image synthesis, generative adversarial network, attention mechanism, single-stage architecture

CLC Number: 

[an error occurred while processing this directive]