Paper
23 May 2023 ADMM-based structure-aligned quantized neural network
Chengxuan Wang
Author Affiliations +
Proceedings Volume 12645, International Conference on Computer, Artificial Intelligence, and Control Engineering (CAICE 2023); 1264528 (2023) https://doi.org/10.1117/12.2681216
Event: International Conference on Computer, Artificial Intelligence, and Control Engineering (CAICE 2023), 2023, Hangzhou, China
Abstract
Nowadays, deep learning methods such as neural network models are highly effective for various tasks including image classification and natural language processing, but the increasingly high computational cost restricts the deployment of these network models in many kinds of scenarios whose resources are usually limited. Among all kinds of methods to solve these difficulties, quantization is a plausible way to reduce the storage size of these network models and accelerate their inference process by replacing the parameters such as weights with low-bit fixed numbers during the training process. This problem can be viewed as a discrete constrained optimization problem. In this work, we use Alternative Direction Methods of Multipliers (ADMM) to decouple the continuous parameters from the discrete constraints so that the original hard optimization problem is separated into several subproblems. In addition, structure-aligned quantization is also achieved, which is usually more friendly for edge computing devices to execute and accelerate. With extensive experiments on ImageNet and CIFAR10 dataset, models represented by low-bit fixed-point numbers with acceptable accuracy loss compared with original full precision models can be acquired. Compared with some previous quantization works, the quantization models obtained in this work have little classification accuracy drop compared with the original pre-trained full precision model, and a kind of hardware-friendly structure that makes the neural network easier to deploy is achieved.
© (2023) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Chengxuan Wang "ADMM-based structure-aligned quantized neural network", Proc. SPIE 12645, International Conference on Computer, Artificial Intelligence, and Control Engineering (CAICE 2023), 1264528 (23 May 2023); https://doi.org/10.1117/12.2681216
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Quantization

Education and training

Neural networks

Convolution

Mathematical optimization

Back to Top