GitHub - cardiaa/METaQ: METaQ (Minimizing Entropy Training and Quantization) is a neural network compression strategy applied during training by adding a regularization term to the loss function, aiming to minimize the entropy of the network's weights.

cardiaa / METaQ Public

Notifications You must be signed in to change notification settings
Fork 0
Star 1

METaQ (Minimizing Entropy Training and Quantization) is a neural network compression strategy applied during training by adding a regularization term to the loss function, aiming to minimize the entropy of the network's weights.

1 star 0 forks Branches Tags Activity

Star

Notifications

Branches Tags

Name		Name	Last commit message	Last commit date
Latest commit History 404 Commits
BestModelsAfterQuantization		BestModelsAfterQuantization
BestModelsBeforeQuantization		BestModelsBeforeQuantization
data/MNIST/raw		data/MNIST/raw
notebooks		notebooks
utils		utils
.DS_Store		.DS_Store
README.txt		README.txt
Report.pdf		Report.pdf
load_and_quantize.py		load_and_quantize.py
output.log		output.log
requirements.txt		requirements.txt
train.py		train.py

Repository files navigation

================================================================================================================================================================================================================================================================================================
WHAT IS METaQ?
METaQ (Minimizing Entropy Training and Quantization) is a neural network compression strategy applied during training by adding a regularization term $\phi(w)$ to the loss function, aiming to minimize the entropy of the network's weights. $\phi(w)$ results from a non-differentiable optimization problem that is computationally complex to solve. However, to train the network, it is not necessary to explicitly compute $\phi(w)$; instead, providing its (sub)gradient to standard machine learning tools (such as PyTorch) is sufficient to guide the training towards a low-entropy weight configuration while maintaining good accuracy. This subgradient is computed using the optimal Lagrange multipliers $\beta^*$ associated with the set of constraints involving the weights w in the problem that defines $\phi(w)$.

================================================================================================================================================================================================================================================================================================

WHAT IS THIS WORK ABOUT?
In this work, we develop a procedure to find $\beta^*$ through the application of Lagrangian relaxation techniques and optimization, devising ad hoc methods for specific subproblems when necessary for efficiency reasons. Once the network is trained with low entropy, the compression strategy culminates in quantizing the weights to the reference values of the buckets where the training has directed them. The weight encoding task is handled by well-known compression algorithms that achieve entropy coding.

================================================================================================================================================================================================================================================================================================

WHERE DID WE CONDUCT EXPERIMENTS?
The tests were conducted using the LeNet-5 network on the MNIST dataset, although the strategy
is also applicable to larger networks. The achieved results show a 29×compression of LeNet-5
(Compression Ratio 3.43%) with an accuracy of 99.01%, making METaQ a strategy comparable to
state-of-the-art NN-compression algorithms.

================================================================================================================================================================================================================================================================================================

HOW IS THE DIRECTORY COMPOSED?
The METaQ main directory is structured as follows:
- BestModelsBeforeQuantization: Stores trained low-entropy models ready for quantization and compression.
- BestModelAfterQuantization: Stores the final quantized models. These are not compressed, so they can be used for inference.
- notebooks: Contains several Jupyter notebooks showcasing outputs of some instances of the main programs (see below). In the notebook Training.ipynb we train the network with the METaQ strategy; if performances are better than those already reached we save the model in the BestModelsBeforeQuantization folder. In the notebook Load&Compression.ipynb we load the model from those created during the training phase and we compress it after having quantize it. In the notebook CuttingPlaneAlgo.ipynb we test all the algorithms to solve the simil-knapsack problem by evaluating their correctness and their performance. In the notebook ComplexityAnalysis.ipynb we evaluate the complexity of the specialized algorithm for the simil-knapsack problem form a probabilistic point of view. Finally, in the notebook SubgradientBundleMethods.ipynb we evaluate and test the algorithms for solve the dual problem of maximization $\phi_w(\xi)$.

Besides the notebook one can run the code with the folder organization as follow.

- utils: Contains a set of utility functions used by the various main programs.

Main Programs:
- train.py: Trains the network using the METaQ term.
- load_and_quantize.py: Loads the trained network with train.py, quantizes, and compresses it.

A detailed explanation of the entire implemented compression process can be found in the Report.pdf file.

================================================================================================================================================================================================================================================================================================

HOW CAN I PLAY WITH IT?
You can use METaQ either with notebooks or .py files, depending on your preferred workflow.

First, you need to launch the network training. Be aware that this phase requires a significant amount of time! So, let your computer do its job and go for a walk!

If you don’t have time to train the model from scratch, you can use pre-trained models available in the BestModelBeforeQuantization and BestModelAfterQuantization folders.

Use the file for loading and compressing the model, ensure you are in the correct directory, and load the model you want. If you have just trained the model, retrieve it from BestModelBeforeQuantization and quantize it using the procedure provided in the program. Otherwise, if it is already quantized, simply compress it and check the results!

Have fun!
================================================================================================================================================================================================================================================================================================