应用多索引加法量化编码的近邻检索算法被引量：3

Applying multi-index additive quantization encoding method for approximate nearest neighbor search

导出

摘要目的海量图像检索技术是计算机视觉领域研究热点之一,一个基本的思路是对数据库中所有图像提取特征,然后定义特征相似性度量,进行近邻检索。海量图像检索技术,关键的是设计满足存储需求和效率的近邻检索算法。为了提高图像视觉特征的近似表示精度和降低图像视觉特征的存储空间需求,提出了一种多索引加法量化方法。方法由于线性搜索算法复杂度高,而且为了满足检索的实时性,需把图像描述符存储在内存中,不能满足大规模检索系统的需求。基于非线性检索的优越性,本文对非穷尽搜索的多索引结构和量化编码进行了探索新研究。利用多索引结构将原始数据空间划分成多个子空间,把每个子空间数据项分配到不同的倒排列表中,然后使用压缩编码的加法量化方法编码倒排列表中的残差数据项,进一步减少对原始空间的量化损失。在近邻检索时采用非穷尽搜索的策略,只在少数倒排列表中检索近邻项,可以大大减少检索时间成本,而且检索过程中不用存储原始数据,只需存储数据集中每个数据项在加法量化码书中的码字索引,大大减少内存消耗。结果为了验证算法的有效性,在3个数据集SIFT、GIST、MNIST上进行测试,召回率相比近几年算法提升4%~15%,平均查准率提高12%左右,检索时间与最快的算法持平。结论本文提出的多索引加法量化编码算法,有效改善了图像视觉特征的近似表示精度和存储空间需求,并提升了在大规模数据集的检索准确率和召回率。本文算法主要针对特征进行近邻检索,适用于海量图像以及其他多媒体数据的近邻检索。 Objective As the amount of image data produced every day increases,large-scale image retrieval technology is one of the hot topics in the field of computer vision. The basic idea is to extract features from all the images in the database and define the similarity measure to perform nearest neighbor search. The key of massive image retrieval is to design a nearest neighbor search algorithm that can meet efficiency and storage needs. An approximate nearest neighbor search based on multi-index additive quantization method is presented to improve the approximate representation accuracy and reduce the storage space requirements of image visual features. Method If each image is described by a set of local descriptors, then an exhaustive search is prohibitive as we need to index billions of descriptors and perform multiple queries. The image descriptors should be stored in memory to ensure real-time retrieval; however,this approach creates a storage problem. Artificial neural network（ANN） algorithms,which mainly include index structure and quantization methods,are typically compared based on the trade-off between search quality and efficiency. On the basis of the superiority of nonlinear search,we employ an inverted multi-index structure to avoid an exhaustive search. The multi-index structure divides the original data space into multiple subspaces. Each subspace that uses an inverted index stores the list of vectors that lie in the proximity of each codeword as each entry of the multi-index table corresponds to a part of the original vector space and contains a list of points that fall within that part. The purpose of a multi-index structure is to generate a list of data vectors that lie close to any query vector efficiently by only searching in a small dataset in which the near neighbors of the query vector most likely lie,thus ensuring a substantial speed-up over the exhaustive search. As a solution to the storage problem,compact code representations of descriptors are used. Vector quantization is an effective and efficient ANN search method. These methods quantize data by using codewords to reduce the cardinality of the data space. Among the vector quantization methods,additive quantization that approximates the vectors using sums of M codewords from M different codebooks generalizes product quantization and further improves product quantization accuracy while retaining its computational efficiency to a large degree. In this study,we use the additive quantization encoding method to encode the residual data produced by the multi-index structure,which further reduces the quantitative loss of original space. We regard the method mentioned previously as a two-stage quantizer that approximates the residual vector of the preceding stage using one of the centroids in the stage codebook and generates a new residual vector for the succeeding quantization stage. The multi-index structure is used as the first-order quantizer to approximate the vectors,and the additive quantization method is utilized as the second-order quantizer to approximate the residual. The non-exhaustive search strategy retrieves only the near neighbors in a few inverted lists,which can significantly reduce the retrieval time cost. With the use of the additive quantization method in the retrieval process,the original data need not be stored in memory and only the index of the codeword in the codebook should be stored,the sum of which is nearest the data item,significantly reducing memory consumption. Result Experiments on three datasets,i. e.,SIFT1M,GIST1M,and MNIST,were conducted to verify the effectiveness of the proposed algorithm. The recall rate of the proposed algorithm is approximately 4% to 15% higher and its average precision is approximately 12% higher than that of existing algorithms. The search time of the proposed algorithm is the same as that of the fastest algorithm. Conclusion An approximate nearest neighbor search based on the multi-index additive quantization method proposed in this study can effectively improve the approximate representation accuracy and reduce the storage space requirements of image visual features. The proposed method also improves retrieval accuracy and recall in large-scale datasets. The proposed algorithm focuses on the nearest neighbor search,which is suitable for large-scale images and other multimedia data.

作者刘恒姚宇曾玲陶攀 Liu Heng;Yao Yu;Zeng Ling;Tao Pan(University of Chinese Academy of Sciences, Beijing 100049, China;Chengdu Institute of Computer Application, Chinese Academy of Sciences, Chengdu 610041, China)

机构地区中国科学院大学中国科学院成都计算机应用研究所

出处《中国图象图形学报》 CSCD 北大核心 2018年第5期652-661,共10页 Journal of Image and Graphics

基金四川省科技厅重点研发基金项目(2017SZ0010 2016JZ0035) 中科院西部之光人才培养计划基金项目

关键词倒排索引压缩编码加法量化近似最近邻检索矢量量化 multi-index compact encoding additive quantization approximate nearest neighbor search vector quantiza-tion

分类号 TP391.41 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

同被引文献40

1朱爽.用直方图面积法进行图像相似度计算[J].测绘通报,2018(12):96-100. 被引量：7
2付晨,钟诚,叶波.MapReduce并行加速数据流多模式相似性搜索[J].计算机应用,2017,37(1):37-41. 被引量：5
3郑万立,杨恒,赵晓坤,陈栩秋.基于条形码及移动APP的光缆信息查询系统技术研究[J].中国新通信,2017,19(1):115-116. 被引量：1
4赵彬,杨祖彬,崔爽.信息型智能包装标签技术的研究进展[J].包装工程,2017,38(3):67-72. 被引量：8
5顾亦然,朱梓嫣.基于LeaderRank和节点相似度的复杂网络重要节点排序算法[J].电子科技大学学报,2017,46(2):441-448. 被引量：35
6王龙江,陈越,严新成,黄恺翔.网络编码云存储系统差分数据更新方案[J].通信学报,2017,38(3):154-164. 被引量：11
7刘萍,李斐雯,杨宇.国外交互式信息检索研究进展[J].情报理论与实践,2017,40(5):132-138. 被引量：13
8周永红,吴芳.大数据时代搜索引擎用户的信息安全问题研究[J].图书馆,2017(5):32-35. 被引量：10
9吴婷.物流配送信息智能传输系统设计[J].现代电子技术,2017,40(13):83-86. 被引量：11
10李璇,邵高鋆,李丽.基于树莓派和条码识别的智能购物车设计[J].电子技术与软件工程,2017(14):135-136. 被引量：6

引证文献3

1景月娟,张晓丽.基于条码识别的标签信息智能检索方法[J].西安工程大学学报,2019,33(4):457-461. 被引量：5
2王旭.基于RGB颜色特征的海报图像自动检索方法[J].自动化技术与应用,2021,40(9):182-186. 被引量：4
3杨凤丽,李娜,刘仁芬.基于多级索引的高维数据近似最近邻搜索[J].计算机仿真,2022,39(11):398-401. 被引量：4

二级引证文献13

1李志杰.无线网络中多源交互信息关键特征检索方法研究[J].电子设计工程,2020,28(1):103-107. 被引量：1
2毕杨,王轩.ORB算法在智能工具箱中的应用研究[J].电子设计工程,2020,28(8):25-29. 被引量：3
3仝梦园,金守峰,陈阳,李毅,尹加杰.改进卷积神经网络的手写试卷分数识别方法[J].西安工程大学学报,2020,34(4):80-85. 被引量：11
4王壮,王洁.SSH框架下基于遗传算法的冷链物流追踪与溯源系统[J].西安工程大学学报,2021,35(2):85-90. 被引量：9
5牛怡婷,熊先青,袁莹莹,张靓婷.板式家具自动化原料与成品仓管控流程对比研究[J].林产工业,2021,58(5):30-33. 被引量：8
6卓雯雯,王小伟,周莉.基于登山环境因素的登山服色彩研究[J].西部皮革,2022,44(5):117-119. 被引量：1
7罗朝熙,和丽芳,杨昌洲,杨社平,黄斌,马关宇,黄宋魏.基于K-means的金属矿物光片嵌布粒度测量[J].有色金属工程,2022,12(7):125-131. 被引量：2
8田珂.二元信息挖掘多模型融合异常弹着靶速度预测[J].弹道学报,2023,35(2):102-110.
9张婷.基于大数据挖掘技术的图书馆服务自动化感知模型[J].自动化与仪器仪表,2023(7):5-9.
10何远景,李光龙.基于多级索引表的金融业务数据库精准查询方法[J].安阳工学院学报,2024,23(2):60-64.

1《太阳能学报》投稿须知[J].太阳能学报,2007,28(10).
2《太阳能学报》投稿须知[J].太阳能学报,2008,29(11).
3李显尧.专利文献及其利用[J].东海海洋,1989,7(2):76-78.
4孙彦楠,夏秀渝.基于深度神经网络的关键词识别系统[J].计算机系统应用,2018,27(5):41-48. 被引量：7
5史忠植.90年代的数据库技术及其市场[J].中国计算机用户,1994(6):73-75.
6林静.拓展信息时代数据存储的极限[J].实用无线电,2001(5):17-19.
7撰稿须知[J].经济纵横,2008(10).
8赵双燕.3D模型检索系统技术及发展趋向探寻[J].电子技术与软件工程,2018(8):66-66.
9李文明,鹿比.作家陈小默[J].文学少年（小学）,2018,0(5):6-8.
10王艳,周小平,王睿,孙冰雪.长白山野生中草药植物图像检索方法研究[J].中国中医药信息杂志,2018,25(2):95-98. 被引量：3

中国图象图形学报

2018年第5期

浏览历史

内容加载中请稍等...

应用多索引加法量化编码的近邻检索算法被引量：3

同被引文献40

引证文献3

二级引证文献13

相关作者

相关机构

相关主题

浏览历史

应用多索引加法量化编码的近邻检索算法 被引量：3

同被引文献40

引证文献3

二级引证文献13

相关作者

相关机构

相关主题

浏览历史

应用多索引加法量化编码的近邻检索算法被引量：3