Neural Network Compression

Abstract:

The paper presents the problem of neural network compression, focusing mainly on deep neural networks (DNNs). The negative effects of over-parameterisation of models and basic compression techniques used in practice are discussed. For each method, several current implementations described in the literature are presented, describing their operating principles and achieved effects at given model compression ratios.