摘要
基于深度融合生成对抗网络(DF-GAN)多个融合模块相互独立,以致网络融合深度较浅并难以得到最优融合结果的问题,本文提出了一种基于深度传播融合生成对抗网络(DPF-GAN)的文本生成图像算法。该算法通过拼接相邻的仿射模块和融合模块,让前面的融合信息传播至后面的融合模块中,从而促进文本和图像更深层次地融合。实验表明,在CUB-200-2011和COCO数据集上,DPF-GAN生成的图像质量要优于DF-GAN,特别是CUB-200-2011数据集的FID指标减少了11.34%。与递归仿射变换生成对抗网络(RAT-GAN)相比,DPF-GAN的空间复杂度更低且推理速度更快。
The multiple fusion modules of deep fusion generative adversarial network(DF-GAN)were independent of each other,which leaded to a shallow fusion depth and made it difficult to obtain the optimal fusion result.Hence,a text-to-image synthesis algorithm which based on deep propagated fusion generative adversarial network(DPF-GAN)was proposed to solve these issues.This algorithm connected adjacent affine and fusion modules through concatenation,so that the previous fusion information can be propagated to the subsequent fusion modules.This facilitates a deeper integration of text and image.Through experimental results on the CUB-200-2011 dataset and COCO dataset,found that the quality of images which generated by DPF-GAN was better than DF-GAN.The FID score on CUB-200-2011 dataset was decreased by approximately 11.34%compared to DF-GAN.Compared to the Recurrent affine transformation generative adversarial network(RAT-GAN),DPF-GAN offers lower spatial complexity and faster inference speed.
作者
吴海峰
兰强
WU Haifeng;LAN Qiang(School of Computer and Information,Anqing Normal University,Anqing 246133,China)
出处
《安庆师范大学学报(自然科学版)》
2024年第3期78-83,共6页
Journal of Anqing Normal University(Natural Science Edition)
基金
安徽省自然科学基金(2108085MF216)。
关键词
文本生成图像
生成对抗网络
仿射变换
深度传播融合
单级主干
text-to-image synthesis
generative adversarial network
affine transformation
deeply propagated fusion
single level backbone