Activation Redistribution Based Hybrid Asymmetric Quantization Method of Neural Networks

下载PDF

导出

摘要 The demand for adopting neural networks in resource-constrained embedded devices is continuously increasing.Quantization is one of the most promising solutions to reduce computational cost and memory storage on embedded devices.In order to reduce the complexity and overhead of deploying neural networks on Integeronly hardware,most current quantization methods use a symmetric quantization mapping strategy to quantize a floating-point neural network into an integer network.However,although symmetric quantization has the advantage of easier implementation,it is sub-optimal for cases where the range could be skewed and not symmetric.This often comes at the cost of lower accuracy.This paper proposed an activation redistribution-based hybrid asymmetric quantizationmethod for neural networks.The proposedmethod takes data distribution into consideration and can resolve the contradiction between the quantization accuracy and the ease of implementation,balance the trade-off between clipping range and quantization resolution,and thus improve the accuracy of the quantized neural network.The experimental results indicate that the accuracy of the proposed method is 2.02%and 5.52%higher than the traditional symmetric quantization method for classification and detection tasks,respectively.The proposed method paves the way for computationally intensive neural network models to be deployed on devices with limited computing resources.Codes will be available on https://github.com/ycjcy/Hybrid-Asymmetric-Quantization.

作者 Lu Wei Zhong Ma Chaojie Yang

机构地区 R&D Innovation Center

出处《Computer Modeling in Engineering & Sciences》 SCIE EI 2024年第1期981-1000,共20页 工程与科学中的计算机建模（英文）

基金 The Qian Xuesen Youth Innovation Foundation from China Aerospace Science and Technology Corporation(Grant Number 2022JY51).

关键词 QUANTIZATION neural network hybrid asymmetric ACCURACY

分类号 TP183 [自动化与计算机技术—控制理论与控制工程]

Computer Modeling in Engineering & Sciences

2024年第1期

浏览历史

内容加载中请稍等...

Activation Redistribution Based Hybrid Asymmetric Quantization Method of Neural Networks

相关作者

相关机构

相关主题

浏览历史