深度神经网络后门防御综述

A Survey on Defense against Deep Neural Network Backdoor Attack

下载PDF

导出

摘要深度学习在各领域全面应用的同时,在其训练阶段和推理阶段也面临着诸多安全威胁。神经网络后门攻击是一类典型的面向深度学习的攻击方式,攻击者通过在训练阶段采用数据投毒、模型编辑或迁移学习等手段,向深度神经网络模型中植入非法后门,使得后门触发器在推理阶段出现时,模型输出会按照攻击者的意图偏斜。这类攻击赋予攻击者在一定条件下操控模型输出的能力,具有极强的隐蔽性和破坏性。因此,有效防御神经网络后门攻击是保证智能化服务安全的重要任务之一,也是智能化算法对抗研究的重要问题之一。本文从计算机视觉领域出发,综述了面向深度神经网络后门攻击的防御技术。首先,对神经网络后门攻击和防御的基础概念进行阐述,分析了神经网络后门攻击的三种策略以及建立后门防御机制的阶段和位置。然后,根据防御机制建立的不同阶段或位置,将目前典型的后门防御方法分为数据集级、模型级、输入级和可认证鲁棒性防御四类。每一类方法进行了详细的分析和总结,分析了各类方法的适用场景、建立阶段和研究现状。同时,从防御的原理、手段和场景等角度对每一类涉及到的具体防御方法进行了综合比较。最后,在上述分析的基础上,从针对新型后门攻击的防御方法、其他领域后门防御方法、更通用的后门防御方法、和防御评价基准等角度对后门防御的未来研究方向进行了展望。 While deep learning is widely applied in various applications,it also faces many security threats in its training and inference phases.The neural network backdoor attack is a typical type of deep learning-oriented attack.An attacker can implant an illegal backdoor into deep neural network model during the training phase by employing techniques such as data poisoning,model editing or transfer learning.When the corresponding backdoor trigger appears in the inference phase,the attacked model will give the wrong output according to the attacker's intention.This kind of attack endows the attacker with the ability to control the output of the model through the backdoor trigger,which is highly concealed and destructive.Therefore,effective defense against neural network backdoor attacks is one of the important tasks to ensure the security of intelligent services,and it is also one of the important issues of intelligent algorithm confrontation.In this paper,the defense techniques for deep neural network backdoor attacks are reviewed from the field of computer vision.First,the basic concepts of neural network backdoor attack and defense are explained.The main attack methods are summarized into three categories,and the reasonable positions,advantages and disadvantages of the corresponding defense mechanisms are outlined.Then,according to the different stages of the defense mechanism,the current typical backdoor defense methods are divided into four categories:dataset-level,model-level,input-level,and certifiable robust defense.The methods of each category are analyzed and summarized in detail according to their applicable scenarios,stages and research status.At the same time,a comprehensive comparison of the specific defense methods involved in each category is made from the perspectives of defense principles,means and scenarios.Finally,on the basis of the above analysis,the future research directions of backdoor defense are prospected from the perspectives of defense methods against new backdoor attacks,backdoor defense methods in other fields,more general backdoor defense methods,and defense evaluation benchmarks.

作者江钦辉李默涵孙彦斌 JIANG Qinhui;LI Mohan;SUN Yanbin(Cyberspace Institute of Advanced Technology,Guangzhou University,Guangzhou 510006,China)

机构地区广州大学网络空间安全学院/网络空间先进技术研究院

出处《信息安全学报》 CSCD 2024年第4期47-63,共17页 Journal of Cyber Security

基金国家自然科学基金(No.62372126,No.62072130) 广东省自然科学基金面上项目(No.2021A1515012307,No.2020A1515010450) 广州市科技计划一般项目(No.202102021207,No.202102020867) 广东省高校创新团队项目(No.2020KCXTD007) 广州市高校创新团队项目(No.202032854) 广东省珠江学者岗位计划(2019)资助。

关键词后门防御后门攻击人工智能安全神经网络深度学习 backdoor defense backdoor attack artificial intelligence security neural network deep learning

分类号 TP18 [自动化与计算机技术—控制理论与控制工程]

引文网络
相关文献

参考文献3

1杜巍,刘功申.深度学习中的后门攻击综述[J].信息安全学报,2022,7(3):1-16. 被引量：10
2李明慧,江沛佩,王骞,沈超,李琦.针对深度学习模型的对抗性攻击与防御[J].计算机研究与发展,2021,58(5):909-926. 被引量：13
3纪守领,杜天宇,李进锋,沈超,李博.机器学习模型安全与隐私研究综述[J].软件学报,2021,32(1):41-67. 被引量：50

二级参考文献4

1韦璠,宋云飞,邵明莉,刘天,陈小红,王祥丰,陈铭松.利用特征融合和整体多样性提升单模型鲁棒性[J].软件学报,2020,31(9):2756-2769. 被引量：4
2纪守领,李进锋,杜天宇,李博.机器学习模型可解释性方法、应用与安全研究综述[J].计算机研究与发展,2019,56(10):2071-2096. 被引量：150
3陈宇飞,沈超,王骞,李琦,王聪,纪守领,李康,管晓宏.人工智能系统安全与隐私风险[J].计算机研究与发展,2019,56(10):2135-2150. 被引量：51
4陈晋音,沈诗婧,苏蒙蒙,郑海斌,熊晖.车牌识别系统的黑盒对抗攻击[J].自动化学报,2021,47(1):121-135. 被引量：10

共引文献70

1马钰锡,张全新,谭毓安,沈蒙.面向智能攻击的行为预测研究[J].软件学报,2021,32(5):1526-1546. 被引量：5
2杨平林,李泽山,郭改枝.基于改进AdaBoost算法识别包装瓶的设计与实现[J].内蒙古师范大学学报（自然科学版）,2021,50(3):268-274. 被引量：1
3邬友朋,赵金龙,贾中营.一种基于KNN/CNN的供热客服音频分类方法[J].电力大数据,2021,24(7):56-66. 被引量：1
4陈传涛,潘丽敏,罗森林,王子文.基于FGSM样本扩充的模型窃取攻击方法研究[J].信息安全研究,2021,7(11):1023-1030. 被引量：2
5Huanhuan Ni,Yiliang Han,Xiaowei Duan,Guohui Yang.An Improved LeNet-5 Model Based on Encrypted Data[J].国际计算机前沿大会会议论文集,2021(2):166-178.
6纪守领,杜天宇,邓水光,程鹏,时杰,杨珉,李博.深度学习模型鲁棒性研究综述[J].计算机学报,2022,45(1):190-206. 被引量：43
7彭长根.人工智能安全治理挑战与对策[J].信息安全研究,2022,8(4):318-325. 被引量：7
8曹刘娟,匡华峰,刘弘,王言,张宝昌,黄飞跃,吴永坚,纪荣嵘.双标签监督的几何约束对抗训练[J].软件学报,2022,33(4):1218-1230.
9刘佳美,孙涵,林磊.基于伪标签的可防御稳定网络[J].计算机技术与发展,2022,32(6):34-38.
10秦宝东,李媛媛,余沛航.云计算辅助的高效决策树隐私保护查询协议[J].西安邮电大学学报,2022,27(1):1-8.

1吕慧,王传良,尹伟,宋永芬,申滨,曹世强,王建文.浅谈化工项目温室气体排放环境影响评价[J].山东化工,2024,53(11):272-276.
2王盛夏.浅析企业自主安全评价的不足及改善建议[J].机电安全,2024(8):4-7.
3杨丙利.山东省调水工程信息化与智能化管理研究及应用[J].中国地名,2024(5):0082-0084.
4宁晓刚,张翰超,张瑞倩.遥感影像高可信智能不变检测技术框架与方法实践[J].测绘学报,2024,53(6):1098-1112.
5范蓉,马佳焱,王华方,徐悟恒.神经网络模型在开发涂胶工作站中的应用[J].软件工程与应用,2024,13(3):367-375.
6张良伟,张智聪,赵容.基于OBE理念的“应用统计学”课程教学改革探索与实践[J].教育教学论坛,2024(19):113-116.
7程昊,熊大生,唐辉明,张抒.考虑非饱和土基质吸力-回弹指数相关性的弹塑性本构模型及有限元实现[J].长江科学院院报,2024,41(7):132-138.
8马泽军,王洁.识解理论视角下庭审话语中控辩双方叙事对抗研究[J].语文学刊,2024,44(3):50-58.

信息安全学报

2024年第4期

浏览历史

内容加载中请稍等...

深度神经网络后门防御综述

参考文献3

二级参考文献4

共引文献70

相关作者

相关机构

相关主题

浏览历史