Guaranteed Quantization Error Computation for Neural Network Model Compression

Cooke, Wesley; Mo, Zihao; Xiang, Weiming

Computer Science > Machine Learning

arXiv:2304.13812 (cs)

[Submitted on 26 Apr 2023]

Title:Guaranteed Quantization Error Computation for Neural Network Model Compression

Authors:Wesley Cooke, Zihao Mo, Weiming Xiang

View PDF

Abstract:Neural network model compression techniques can address the computation issue of deep neural networks on embedded devices in industrial systems. The guaranteed output error computation problem for neural network compression with quantization is addressed in this paper. A merged neural network is built from a feedforward neural network and its quantized version to produce the exact output difference between two neural networks. Then, optimization-based methods and reachability analysis methods are applied to the merged neural network to compute the guaranteed quantization error. Finally, a numerical example is proposed to validate the applicability and effectiveness of the proposed approach.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Neural and Evolutionary Computing (cs.NE)
Cite as:	arXiv:2304.13812 [cs.LG]
	(or arXiv:2304.13812v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2304.13812

Submission history

From: Weiming Xiang [view email]
[v1] Wed, 26 Apr 2023 20:21:54 UTC (121 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2023-04

Change to browse by:

cs
cs.AI
cs.NE

References & Citations

export BibTeX citation

Computer Science > Machine Learning

Title:Guaranteed Quantization Error Computation for Neural Network Model Compression

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Guaranteed Quantization Error Computation for Neural Network Model Compression

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators