Release Intel® Neural Compressor v1.8 Release · intel/neural-compressor

Features

Knowledge distillation
- Implemented the algorithms of paper “Pruning Once For All” accepted by NeurIPS 2021 ENLSP workshop
- Supported optimization pipelines (knowledge distillation & quantization-aware training) on PyTorch
Quantization
- Added the support of ONNX RT 1.7
- Added the support of TensorFlow 2.6.2 and 2.7
- Added the support of PyTorch 1.10
Pruning
- Supported magnitude pruning on TensorFlow
Acceleration library
- Supported Hugging Face top 10 downloaded NLP models

Productivity

Ecosystem

Added notebook of using HuggingFace optimization library (Optimum) to Transformers
Enabled top 20 downloaded Hugging Face NLP models with Optimum
Upstreamed more INC quantized models to ONNX Model Zoo

Validated Configurations

	Channel	Links	Install Command
Source	Github	https://github.com/intel/neural-compressor.git	$ git clone https://github.com/intel/neural-compressor.git
Binary	Pip	https://pypi.org/project/neural-compressor	$ pip install neural-compressor
Binary	Conda	https://anaconda.org/intel/neural-compressor	$ conda install neural-compressor -c conda-forge -c intel

Please feel free to contact [email protected], if you get any questions.

Provide feedback