Open Access System for Information Sharing

Login Library

 

Conference
Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

NIPQ: Noise proxy-based Integrated Pseudo-Quantization

Title
NIPQ: Noise proxy-based Integrated Pseudo-Quantization
Authors
Shin, JuncheolSo, JunhyukPark, SeinKang, SeungyeopYoo, SungjooPARK, EUNHYEOK
Date Issued
2023-06-20
Publisher
IEEE Computer Society
Abstract
Straight-through estimator (STE), which enables the gradient flow over the non-differentiable function via approximation, has been favored in studies related to quantization-aware training (QAT). However, STE incurs unstable convergence during QAT, resulting in notable quality degradation in low precision. Recently, pseudo-quantization training has been proposed as an alternative approach to updating the learnable parameters using the pseudo-quantization noise instead of STE. In this study, we propose a novel noise proxy-based integrated pseudo-quantization (NIPQ) that enables unified support of pseudo-quantization for both activation and weight by integrating the idea of truncation on the pseudo-quantization framework. NIPQ updates all of the quantization parameters (e.g., bit-width and truncation boundary) as well as the network parameters via gradient descent without STE instability. According to our extensive experiments, NIPQ outperforms existing quantization algorithms in various vision and language applications by a large margin.
URI
https://oasis.postech.ac.kr/handle/2014.oak/120968
Article Type
Conference
Citation
2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2023, page. 3852 - 3861, 2023-06-20
Files in This Item:
There are no files associated with this item.

qr_code

  • mendeley

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

Views & Downloads

Browse