Performance Evaluation of Low-Precision Quantized LeNet and ConvNet Neural Networks

Tatar, Güner; Bayar, Salih; ÇiÇek, İhsan

doi:10.1109/INISTA55318.2022.9894261

Performance Evaluation of Low-Precision Quantized LeNet and ConvNet Neural Networks

dc.contributor.author	Tatar, Güner
dc.contributor.author	Bayar, Salih
dc.contributor.author	ÇiÇek, İhsan
dc.date.accessioned	2025-10-29T12:08:21Z
dc.date.issued	2022
dc.department	Fakülteler, Mühendislik Fakültesi, Elektronik Mühendisliği Bölümü
dc.description	16th International Conference on INnovations in Intelligent SysTems and Applications, INISTA 2022 -- Biarritz -- 182947
dc.description.abstract	Low-precision neural network models are crucial for reducing the memory footprint and computational density. However, existing methods must have an average of 32-bit floating-point (FP32) arithmetic to maintain the accuracy. Floating-point numbers need grave memory requirements in convolutional and deep neural network models. Also, large bit-widths cause too much computational density in hardware architectures. Moreover, existing models must evolve into deeper network models with millions or billions of parameters to solve today's problems. The large number of model parameters increase the computational complexity and cause memory allocation problems, hence existing hardware accelerators become insufficient to address these problems. In applications where accuracy can be traded-off for the sake of hardware complexity, quantization of models enable the use of limited hardware resources to implement neural networks. From hardware design point of view, quantized models are more advantageous in terms of speed, memory and power consumption than using FP32. In this study, we compared the training and testing accuracy of the quantized LeNet and our own ConvNet neural network models at different epochs. We quantized the models using low precision int-4, int-8 and int-16. As a result of the tests, we observed that the LeNet model could only reach 63.59% test accuracy at 400 epochs with int-16. On the other hand, the ConvNet model achieved a test accuracy of 76.78% at only 40 epochs with low precision int-8 quantization. © 2022 Elsevier B.V., All rights reserved.
dc.description.sponsorship	The IEEE Systems, Man, and Cybernetics Society (SMC)
dc.identifier.doi	10.1109/INISTA55318.2022.9894261
dc.identifier.isbn	9781665498104
dc.identifier.scopus	2-s2.0-85139597429
dc.identifier.scopusquality	N/A
dc.identifier.uri	https://doi.org/10.1109/INISTA55318.2022.9894261
dc.identifier.uri	https://hdl.handle.net/20.500.14854/14440
dc.indekslendigikaynak	Scopus
dc.language.iso	en
dc.publisher	Institute of Electrical and Electronics Engineers Inc.
dc.relation.publicationcategory	Konferans Öğesi - Uluslararası - Kurum Öğretim Elemanı
dc.rights	info:eu-repo/semantics/closedAccess
dc.snmz	KA_Scopus_20251020
dc.subject	ConvNet
dc.subject	Convolutional neural networks
dc.subject	Fixed point arithmetic
dc.subject	Floating point arithmetic
dc.subject	FPGA
dc.subject	Hardware accelerators
dc.subject	LeNet
dc.subject	Quantized neural networks
dc.title	Performance Evaluation of Low-Precision Quantized LeNet and ConvNet Neural Networks
dc.type	Conference Object

Koleksiyon

Scopus İndeksli Yayınlar Koleksiyonu
Mühendislik Fakültesi Koleksiyonu

Performance Evaluation of Low-Precision Quantized LeNet and ConvNet Neural Networks

Dosyalar

Koleksiyon