期刊文献+

基于深度学习的数据库重复记录检测算法 被引量:2

Database Duplicate Record Detection Algorithm Based on Deep Learning
下载PDF
导出
摘要 为了提高数据库重复记录检测效果,提出了基于深度学习的数据库重复记录检测算法。首先分析当前数据库重复记录检测的进展,找到引起数据库重复记录检测效果差的原因,然后利用深度学习算法中的支持向量机对数据库重复记录检测进行建模,并引入量子粒子群算法优化支持向量机参数,最后进行了数据库重复记录检测仿真实验,结果表明,文中算法的数据库重复记录检测正确率和效率均很高,数据库重复记录检测结果明显优于当前其它算法。 In order to improve the effect of database duplicate record detection and optimization,a database duplicate record detection algorithm based on deep learning is proposed.Firstly,the paper analyzes the development of duplicate record detection and optimization in database,finds out the reasons for the poor effect of duplicate record detection and optimization in database,then the SVM in deep learning algorithm models the duplicate record detection and optimization in database,and introduces the quantum particle swarm optimization algorithm to optimize the parameters of SVM,and finally carries out the simulation experiment of duplicate record detection in database.The results of testing show that the recall rate and accuracy rate of the algorithm are very high,and the database duplicate record detection results are obviously better than other current algorithms.
作者 陶姿邑 TAO Ziyi(Information Construction Management Office,Shanxi University of Chinese Medicine,Xianyang 712046,China)
出处 《微型电脑应用》 2020年第12期174-176,共3页 Microcomputer Applications
关键词 数据库 重复记录检测 深度学习 量子粒子群算法 databases duplicate record detection deep learning quantum particle swarm algorithm
  • 相关文献

参考文献10

二级参考文献73

  • 1葛利.一种基于混合遗传算法学习的过程神经网络[J].哈尔滨工业大学学报,2005,37(7):986-988. 被引量:21
  • 2陶勇,丁维明.数据库中规范化与反规范化设计的比较与分析[J].计算机技术与发展,2006,16(4):107-109. 被引量:6
  • 3朱恒民,王宁生.一种改进的相似重复记录检测方法[J].控制与决策,2006,21(7):805-808. 被引量:12
  • 4刘智斌,李占利,曹宝香,刘晓峰.虚拟环境中的织物纹理映射算法[J].计算机工程,2006,32(19):205-207. 被引量:3
  • 5Liang Jin, Chen Li, Mehrotra S. Efficient record linkage in large data sets[C]//Proc, of the 8th Int'l Conf on Database. Systems for Advanced Applications. Washington: IEE[Computer Socie- ty, 2003 : 137-148.
  • 6Elmagarmid A K, Panagiotis G, et al. Duplicate record detection: a survey[J]. IEEE Transactions on Knowledge and Data Engi- neering, 2007,19 (1) : 1-16.
  • 7Elmagarmid K, Panagiotis G. Duplicate record detection: a sur- vey [J]. IEEE Transaction on Knowledge and Data Enginee- ring, 2007,19(1) : 1-16.
  • 8Minton S N, Nanjo C, Knobloek C A. A heterogeneous field matching method for record linkage[C]//Proceedings of the 5th International Conference on Data Mining. Washington: IEE[Computer Society, 2005 : 314-321.
  • 9Imagarmid A K, Ipeirotis P G, Verykios V S. Duplicate record detec- tion:a survey [ J ]. IEEE Transactions on Knowledge and Data Engi- neering,2007,19 ( 1 ) : 1 - 16.
  • 10Li Huang, Hai Jin, Pingpeng Yuan, et al. Duplicate records cleansing with length filtering and dynamic weighting [ C ]. Fourth International Conference on Semantics, Knowledge and Grid. Beijing: IEEE Press, 2008:95 - 102.

共引文献74

同被引文献10

引证文献2

二级引证文献6

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部