Dynamic hard pruning of Neural Networks at the edge of the internet

Lorenzo Valerio; Franco Maria Nardini; Andrea Passarella; Raffaele Perego

doi:10.5281/zenodo.6322721

Published March 2, 2022 | Version v1

Journal article Open

Dynamic hard pruning of Neural Networks at the edge of the internet

1. IIT-CNR
2. ISTI-CNR

Neural Networks (NN), although successfully applied to several Artificial Intelligence tasks, are often unnecessarily over-parametrized. In edge/fog computing, this might make their training prohibitive on resource-constrained devices, contrasting with the current trend of decentralizing intelligence from remote data centres to local constrained devices. Therefore, we investigate the problem of training effective NN models on constrained devices having a fixed, potentially small, memory budget. We target techniques that are both resource-efficient and performance effective while enabling significant network compression. Our Dynamic Hard Pruning (DynHP) technique incrementally prunes the network during training, identifying neurons that marginally contribute to the model accuracy. DynHP enables a tunable size reduction of the final neural network and reduces the NN memory occupancy during training. Freed memory is reused by a dynamic batch sizing approach to counterbalance the accuracy degradation caused by the hard pruning strategy, improving its convergence and effectiveness. We assess the performance of DynHP through reproducible experiments on three public datasets, comparing them against reference competitors. Results show that DynHP compresses a NN up to 10 times without significant performance drops (up to 3.5% additional error w.r.t. the competitors), reducing up to 80% the training memory occupancy.

Notes

This work is partially supported by four projects: HumanE AI Network (EU H2020 HumanAI-Net, GA #952026), SoBigData++ (EU H2020 SoBigData++, GA #871042) OK-INSAID (MIUR PON ARS01 00917), H2020 MARVEL (GA #957337), SAI: Social Explainable AI (EC CHIST-ERA-19-XAI-010).

Files

JNCA2022_Valerio_etal_preprint.pdf

Files (1.2 MB)

Name	Size	Download all
JNCA2022_Valerio_etal_preprint.pdf md5:d84da8e5128ed3abf8cc211ff4f7be7d	1.2 MB	Preview Download

Additional details

Is published in: Journal article: 10.1016/j.jnca.2021.103330 (DOI)
Is supplemented by: Dataset: http://yann.lecun.com/exdb/mnist/ (URL); Dataset: https://github.com/zalandoresearch/fashion-mnist (URL); Dataset: https://www.cs.toronto.edu/~kriz/cifar.html (URL)

European Commission
HumanE-AI-Net - HumanE AI Network 952026
European Commission
MARVEL - Multimodal Extreme Scale Data Analytics for Smart Cities Environments 957337
European Commission
SoBigData-PlusPlus - SoBigData++: European Integrated Infrastructure for Social Mining and Big Data Analytics 871042

	All versions	This version
Views	327	327
Downloads	246	245
Data volume	301.6 MB	300.4 MB

JNCA2022_Valerio_etal_preprint.pdf

Files (1.2 MB)

Related works

Funding

Dynamic hard pruning of Neural Networks at the edge of the internet

Authors/Creators

Description

Notes

Files

JNCA2022_Valerio_etal_preprint.pdf

Files (1.2 MB)

Additional details

Related works

Funding