ADMM-based structure-aligned quantized neural network

Chengxuan Wang

doi:10.1117/12.2681216

23 May 2023 ADMM-based structure-aligned quantized neural network

Chengxuan Wang

Proceedings Volume 12645, International Conference on Computer, Artificial Intelligence, and Control Engineering (CAICE 2023); 1264528 (2023) https://doi.org/10.1117/12.2681216
Event: International Conference on Computer, Artificial Intelligence, and Control Engineering (CAICE 2023), 2023, Hangzhou, China

Abstract

Nowadays, deep learning methods such as neural network models are highly effective for various tasks including image classification and natural language processing, but the increasingly high computational cost restricts the deployment of these network models in many kinds of scenarios whose resources are usually limited. Among all kinds of methods to solve these difficulties, quantization is a plausible way to reduce the storage size of these network models and accelerate their inference process by replacing the parameters such as weights with low-bit fixed numbers during the training process. This problem can be viewed as a discrete constrained optimization problem. In this work, we use Alternative Direction Methods of Multipliers (ADMM) to decouple the continuous parameters from the discrete constraints so that the original hard optimization problem is separated into several subproblems. In addition, structure-aligned quantization is also achieved, which is usually more friendly for edge computing devices to execute and accelerate. With extensive experiments on ImageNet and CIFAR10 dataset, models represented by low-bit fixed-point numbers with acceptable accuracy loss compared with original full precision models can be acquired. Compared with some previous quantization works, the quantization models obtained in this work have little classification accuracy drop compared with the original pre-trained full precision model, and a kind of hardware-friendly structure that makes the neural network easier to deploy is achieved.

Citation Download Citation

Chengxuan Wang "ADMM-based structure-aligned quantized neural network", Proc. SPIE 12645, International Conference on Computer, Artificial Intelligence, and Control Engineering (CAICE 2023), 1264528 (23 May 2023); https://doi.org/10.1117/12.2681216

ACCESS THE FULL ARTICLE

INSTITUTIONAL
Select your institution to access the SPIE Digital Library.

SELECT YOUR INSTITUTION

PERSONAL
Sign in with your SPIE account to access your personal subscriptions or to use specific features such as save to my library, sign up for alerts, save searches, etc.

PERSONAL SIGN IN

No SPIE Account? Create one

PURCHASE THIS CONTENT

SUBSCRIBE TO DIGITAL LIBRARY

50 downloads per 1-year subscription

Members: $195

Non-members: $335 ADD TO CART

25 downloads per 1 - year subscription

Members: $145

Non-members: $250 ADD TO CART

PURCHASE SINGLE ARTICLE

Includes PDF, HTML & Video, when available

Members: $17.00

Non-members: $21.00 ADD TO CART

PROCEEDINGS
6 PAGES

DOWNLOAD PAPER SAVE TO MY LIBRARY

GET CITATION

RIGHTS & PERMISSIONS

Get copyright permission Get copyright permission on Copyright Marketplace

KEYWORDS

Quantization

Education and training

Neural networks

Convolution

Mathematical optimization

RELATED CONTENT

An improved CNN algorithm for accelerating structural optimization with pulsating...
Proceedings of SPIE (March 16 2023)

Research and implementation of crop pests recognition method based on...
Proceedings of SPIE (May 25 2023)

Nameplate text recognition model based on attention mechanism
Proceedings of SPIE (November 08 2023)

On optimizing morphological neural networks for hyperspectral image classification
Proceedings of SPIE (April 03 2024)

Multitask deep co design for extended depth of field and...
Proceedings of SPIE (June 18 2024)

Deep learning fault diagnosis method for rolling bearings in rolling...
Proceedings of SPIE (October 10 2023)

Transformer fault diagnosis method based on ATT-CNN-Bi-LSTM
Proceedings of SPIE (May 13 2024)

Subscribe to Digital Library

Receive Erratum Email Alert

Keywords/Phrases

Search In:

Publication Years