摘要
针对传统卷积神经网络感受野的大小受限和特征交互学习能力弱,基于卷积神经网络的伪造人脸检测技术提取到的特征相对单一的问题,提出了基于增强Swin Transformer的深度伪造人脸检测方法,引入了局部多头自注意力和全局多头自注意力机制,结合了Swin Transformer的优势,能够有效地捕获图像上下文信息和视频时序关系,具有较强的全局感受野和长距离依赖建模能力。在DFDC数据集的实验结果表明,该方法优于基线方法,具有较好的深度伪造人脸检测能力。
Addressing the issues of limited receptive field size and weak feature interaction learning capabilities in traditional convolutional neural networks,resulting in relatively singular feature extraction in conventional convolutional neural network‑based deepfake face detection techniques,a deepfake face detection method based on enhanced Swin Transformer is proposed in this pa‑per.This method introduces local multi‑head self‑attention and global multi‑head self‑attention mechanisms,leveraging the strengths of Swin Transformer to effectively capture image context information and video temporal relationships,with strong global receptive fields and long‑distance dependency modeling capabilities.Experimental results on the DFDC dataset demonstrate that our approach outperforms baseline methods,exhibiting superior deepfake face detection capabilities.
作者
李杏清
王志兵
杨恺
Li Xingqing;Wang Zhibing;Yang Kai(College of Information Engineering,Guangdong Innovative Technical College,Dongguan 523960,China;School of Electronic Information,Dongguan Polytechnic,Dongguan 523808,China;School of Architecture,Dongguan Polytechnic,Dongguan 523808,China)
出处
《现代计算机》
2024年第14期26-30,58,共6页
Modern Computer
基金
广东省教育厅2022年度普通高校科研平台特色创新类项目(2022KTSCX385)
广东省教育厅2023年度普通高校科研平台特色创新类项目(2023KTSCX356)
2022年东莞市社会发展科技面上项目(20221800903482)
东莞职业技术学院科研基金资助项目(2022a01)。