Try pruning here

Quantization

Quantization is the process of reducing the precision of the numbers used to represent model weights, which can significantly decrease the model size and increase inferencing speed without substantial loss in accuracy.

Coming Soon!