Releases · intel/neural-compressor

15 Apr 14:04

ftian1

v1.11

0d766c7

Intel® Neural Compressor v1.11 Release

Features

Quantization
- Supported QDQ as experimental quantization format for ONNX Runtime
- Improved FX symbolic tracing for PyTorch
- Supported multi-metrics for quantization tuning
Knowledge distillation
- Improved distillation algorithm for intermediate layer knowledge transfer
Productivity
- Improved quantization productivity for ONNX Runtime through GUI
- Improved PyTorch INT8 model save/load methods
Ecosystem
- Upstreamed INC quantized Yolov3, DenseNet, Mask-Rcnn, Yolov4 models to ONNX Model Zoo
- Became PyTorch ecosystem tool shortly after published PyTorch INC tutorial
Examples
- Added INC quantized ResNet50 v1.5 and BERT-Large model for IPEX
- Supported dynamic quantization & weight sharing on bare metal reference engine

Assets 2

28 Feb 05:27

ftian1

v1.10

1eb6529

Intel® Neural Compressor v1.10 Release

Features

Quantization
- Supported the quantization on latest deep learning frameworks
- Supported the quantization for a new model domain (Audio)
- Supported the compatible quantization recipes for framework upgrade
Pruning & Knowledge distillation
- Supported fine-tuning and quantization using INC & Optimum for “Prune Once for All: Sparse Pre-Trained Language Models” published at ENLSP NeurIPS Workshop 2021
Structured sparsity
- Proved the sparsity training recipes across multiple model domains (CV, NLP, and Recommendation System)

Productivity

Improved INC GUI for easy quantization
Supported Windows OS conda installation

Ecosystem

Upgraded INC v1.9 into HuggingFace Optimum
Upsteamed INC quantized mobilenet & faster-rcnn models to ONNX Model Zoo

Examples

Supported quantization on 300 random models
Added bare-metal examples for Bert-mini and DLRM

Validated Configurations

Python 3.7, 3.8, 3.9
Centos 8.3 & Ubuntu 18.04 & Win10
TensorFlow 2.6.2, 2.7, 2.8
Intel TensorFlow 1.15.0 UP3, 2.7, 2.8
PyTorch 1.8.0+cpu, 1.9.0+cpu, 1.10.0+cpu
IPEX 1.8.0, 1.9.0, 1.10.0
MxNet 1.6.0, 1.7.0, 1.8.0
ONNX Runtime 1.8.0, 1.9.0, 1.10.0

Distribution:

	Channel	Links	Install Command
Source	Github	https://github.com/intel/neural-compressor.git	$ git clone https://github.com/intel/neural-compressor.git
Binary	Pip	https://pypi.org/project/neural-compressor	$ pip install neural-compressor
Binary	Conda	https://anaconda.org/intel/neural-compressor	$ conda install neural-compressor -c conda-forge -c intel

Contact:

Please feel free to contact [email protected], if you get any questions.

Assets 2

04 Jan 01:56

ftian1

v1.9

768c49e

Intel® Neural Compressor v1.9 Release

Features

Knowledge distillation
- Supported one-shot compression pipelines (knowledge distillation during quantization-aware training) on PyTorch
- Added more distillation examples on TensorFlow and PyTorch
Quantization
- Supported multi-objective tuning for quantization
- Supported Intel Extension for PyTorch v1.10 version
- Improved quantization-aware training support on PyTorch v1.10
Pruning
- Added more magnitude pruning examples on TensorFlow
Reference bara-metal examples
- Supported BF16 optimizations on NLP models
- Added sparse DLRM model (experimental)
Productivity
- Added Python favorable API (alternative to YAML configuration file)
- Improved user facing APIs more pythonic
Ecosystem
- Integrated pruning API into HuggingFace Optimum
- Added ssd-mobilenetv1, efficientnet, ssd, fcn_rn50, inception_v1 quantized models to ONNX Model Zoo

Validated Configurations

Python 3.7 & 3.8 & 3.9
Centos 8.3 & Ubuntu 18.04
TensorFlow 2.6.2 & 2.7
Intel TensorFlow 2.4.0, 2.5.0 and 1.15.0 UP3
PyTorch 1.8.0+cpu, 1.9.0+cpu, IPEX 1.8.0
MxNet 1.6.0, 1.7.0, 1.8.0
ONNX Runtime 1.6.0, 1.7.0, 1.8.0

Distribution:

	Channel	Links	Install Command
Source	Github	https://github.com/intel/neural-compressor.git	$ git clone https://github.com/intel/neural-compressor.git
Binary	Pip	https://pypi.org/project/neural-compressor	$ pip install neural-compressor
Binary	Conda	https://anaconda.org/intel/neural-compressor	$ conda install neural-compressor -c conda-forge -c intel

Contact:

Please feel free to contact [email protected], if you get any questions.

Assets 2

10 Dec 07:24

ftian1

v1.8.1

561a47e

Intel® Neural Compressor v1.8.1 Release

Features

Knowledge distillation
- Supported knowledge distillation on TensorFlow
Pruning
- Support Multi-node training on TensorFlow
Acceleration library
- Supported Hugging Face minilm_l6_h384_uncased_sst2, bert_base_cased_mrpc, and bert_base_nli_mean_tokens_stsb models

Validated Configurations

Python 3.6 & 3.7 & 3.8 & 3.9
Centos 8.3 & Ubuntu 18.04
TensorFlow 2.6.2 & 2.7
Intel TensorFlow 2.4.0, 2.5.0 and 1.15.0 UP3
PyTorch 1.8.0+cpu, 1.9.0+cpu, IPEX 1.8.0
MxNet 1.6.0, 1.7.0, 1.8.0
ONNX Runtime 1.6.0, 1.7.0, 1.8.0

Distribution:

	Channel	Links	Install Command
Source	Github	https://github.com/intel/neural-compressor.git	$ git clone https://github.com/intel/neural-compressor.git
Binary	Pip	https://pypi.org/project/neural-compressor	$ pip install neural-compressor
Binary	Conda	https://anaconda.org/intel/neural-compressor	$ conda install neural-compressor -c conda-forge -c intel

Contact:

Please feel free to contact [email protected], if you get any questions.

Assets 2

22 Nov 05:22

ftian1

v1.8

d25f0fd

Intel® Neural Compressor v1.8 Release

Features

Knowledge distillation
- Implemented the algorithms of paper “Pruning Once For All” accepted by NeurIPS 2021 ENLSP workshop
- Supported optimization pipelines (knowledge distillation & quantization-aware training) on PyTorch
Quantization
- Added the support of ONNX RT 1.7
- Added the support of TensorFlow 2.6.2 and 2.7
- Added the support of PyTorch 1.10
Pruning
- Supported magnitude pruning on TensorFlow
Acceleration library
- Supported Hugging Face top 10 downloaded NLP models

Productivity

Added performance profiling feature to INC UI service.
Improved ease-of-use user interface for quantization with few clicks

Ecosystem

Added notebook of using HuggingFace optimization library (Optimum) to Transformers
Enabled top 20 downloaded Hugging Face NLP models with Optimum
Upstreamed more INC quantized models to ONNX Model Zoo

Validated Configurations

Python 3.6 & 3.7 & 3.8 & 3.9
Centos 8.3 & Ubuntu 18.04
TensorFlow 2.6.2 & 2.7
Intel TensorFlow 2.4.0, 2.5.0 and 1.15.0 UP3
PyTorch 1.8.0+cpu, 1.9.0+cpu, IPEX 1.8.0
MxNet 1.6.0, 1.7.0, 1.8.0
ONNX Runtime 1.6.0, 1.7.0, 1.8.0

Distribution:

	Channel	Links	Install Command
Source	Github	https://github.com/intel/neural-compressor.git	$ git clone https://github.com/intel/neural-compressor.git
Binary	Pip	https://pypi.org/project/neural-compressor	$ pip install neural-compressor
Binary	Conda	https://anaconda.org/intel/neural-compressor	$ conda install neural-compressor -c conda-forge -c intel

Contact:

Please feel free to contact [email protected], if you get any questions.

Assets 2

24 Oct 23:35

ftian1

v1.7.1

b5a84e7

Intel® Neural Compressor v1.7.1 Release

Intel® Neural Compressor(formerly known as Intel® Low Precision Optimization Tool) v1.7 release is featured by:

Features

Acceleration library
- Support unified buffer memory allocation policy

Ecosystem

Upstreamed INC quantized models (alexnet/caffenet/googlenet/squeezenet) to ONNX Model Zoo

Documentation

Performance and accuracy data update

Validated Configurations

Python 3.6 & 3.7 & 3.8 & 3.9
Centos 8.3 & Ubuntu 18.04
TensorFlow 2.6.0
Intel TensorFlow 2.4.0, 2.5.0 and 1.15.0 UP3
PyTorch 1.8.0+cpu, 1.9.0+cpu, IPEX 1.8.0
MxNet 1.6.0, 1.7.0, 1.8.0
ONNX Runtime 1.6.0, 1.7.0, 1.8.0

Distribution:

	Channel	Links	Install Command
Source	Github	https://github.com/intel/neural-compressor.git	$ git clone https://github.com/intel/neural-compressor.git
Binary	Pip	https://pypi.org/project/neural-compressor	$ pip install neural-compressor
Binary	Conda	https://anaconda.org/intel/neural-compressor	$ conda install neural-compressor -c conda-forge -c intel

Contact:

Please feel free to contact INC Maintainers, if you get any questions.

Assets 2

01 Oct 06:05

ftian1

v1.7

7607ee4

Intel® Neural Compressor v1.7 Release

Intel® Neural Compressor(formerly known as Intel® Low Precision Optimization Tool) v1.7 release is featured by:

Features

Quantization
- Improved quantization accuracy in SSD-Reset34 and MobileNet v3 on TensorFlow
Pruning
- Supported magnitude pruning on TensorFlow
Knowledge distillation
- Supported knowledge distillation on PyTorch
Multi-node support
- Supported multi-node pruning with distributed dataloader on PyTorch
- Supported multi-node inference for benchmark on PyTorch
Acceleration library
- Added a domain-specific acceleration library for NLP models

Productivity

Supported the configuration-free (pure Python) quantization
Improved ease-of-use user interface for quantization with few clicks

Ecosystem

Integrated into HuggingFace optimization library (Optimum)
Upstreamed INC quantized models (RN50, VGG16) to ONNX Model Zoo

Documentation

Add tutorial and examples for knowledge distillation
Add tutorial and examples for multi-node training
Add tutorial and examples for acceleration library

Validated Configurations

Python 3.6 & 3.7 & 3.8 & 3.9
Centos 8.3 & Ubuntu 18.04
TensorFlow 2.6.0
Intel TensorFlow 2.4.0, 2.5.0 and 1.15.0 UP3
PyTorch 1.8.0+cpu, 1.9.0+cpu, IPEX 1.8.0
MxNet 1.6.0, 1.7.0, 1.8.0
ONNX Runtime 1.6.0, 1.7.0, 1.8.0

Distribution:

	Channel	Links	Install Command
Source	Github	https://github.com/intel/neural-compressor.git	$ git clone https://github.com/intel/neural-compressor.git
Binary	Pip	https://pypi.org/project/neural-compressor	$ pip install neural-compressor
Binary	Conda	https://anaconda.org/intel/neural-compressor	$ conda install neural-compressor -c conda-forge -c intel

Contact:

Please feel free to contact [email protected], if you get any questions.

Assets 2

20 Aug 17:08

ftian1

v1.6

22ee51b

Intel® Low Precision Optimization Tool v1.6 Release

Intel® Low Precision Optimization Tool v1.6 release is featured by:

Pruning:

Support pruning and post-training quantization pipeline on PyTorch
Support pruning during quantization-aware training on PyTorch

Quantization:

Support post-training quantization on TensorFlow 2.6.0, PyTorch 1.9.0, IPEX 1.8.0, and MXNet 1.8.0
Support quantization-aware training on TensorFlow 2.x (Keras API)

User Experience:

Improve quantization productivity with new UI
Support quantized model recovery from tuning history

New Models:

Support ResNet50 on ONNX model zoo

Documentation:

Add pruned models
Add quantized MLPerf models

Validated Configurations:

Python 3.6 & 3.7 & 3.8 & 3.9
Centos 8.3 & Ubuntu 18.04
TensorFlow 2.6.0
Intel TensorFlow 2.4.0, 2.5.0 and 1.15.0 UP3
PyTorch 1.8.0+cpu, 1.9.0+cpu, IPEX 1.8.0
MxNet 1.6.0, 1.7.0, 1.8.0
ONNX Runtime 1.6.0, 1.7.0, 1.8.0

Distribution:

	Channel	Links	Install Command
Source	Github	https://github.com/intel/lpot.git	$ git clone https://github.com/intel/lpot.git
Binary	Pip	https://pypi.org/project/lpot	$ pip install lpot
Binary	Conda	https://anaconda.org/intel/lpot	$ conda install lpot -c conda-forge -c intel

Contact:

Please feel free to contact [email protected], if you get any questions.

Assets 2

25 Jul 14:26

ftian1

v1.5.1

1e29588

Intel® Low Precision Optimization Tool v1.5.1 Release

Intel® Low Precision Optimization Tool v1.5.1 release is featured by:

Gradient-sensitivity pruning for CNN model
Static quantization support for ONNX NLP model
Dynamic seq length support in NLP dataloader
Enrich quantization statistics

Validated Configurations:

Python 3.6 & 3.7 & 3.8 & 3.9
Centos 8.3 & Ubuntu 18.04
Intel TensorFlow 1.15.2, 2.1.0, 2.2.0, 2.3.0, 2.4.0, 2.5.0 and 1.15.0 UP1 & UP2 & UP3
PyTorch 1.5.0+cpu, 1.6.0+cpu, 1.8.0+cpu, ipex
MxNet 1.6.0, 1.7.0
ONNX Runtime 1.6.0, 1.7.0, 1.8.0

Distribution:

	Channel	Links	Install Command
Source	Github	https://github.com/intel/lpot.git	$ git clone https://github.com/intel/lpot.git
Binary	Pip	https://pypi.org/project/lpot	$ pip install lpot
Binary	Conda	https://anaconda.org/intel/lpot	$ conda install lpot -c conda-forge -c intel

Contact:

Please feel free to contact [email protected], if you get any questions.

Assets 2

12 Jul 14:23

ftian1

v1.5

b29bf66

Intel® Low Precision Optimization Tool v1.5 Release

Intel® Low Precision Optimization Tool v1.5 release is featured by:

Add pattern-lock sparsity algorithm for NLP fine-tuning tasks
- Up to 70% unstructured sparsity and 50% structured sparsity with <2% accuracy loss on 5 Bert finetuning tasks
Add NLP head pruning algorithm for HuggingFace models
- Performance speedup up to 3.0X within 1.5% accuracy loss on HuggingFace BERT SST-2
Support model optimization pipeline
Integrate SigOPT with multi-metrics optimization
- Complementary as basic strategy to speed up the tuning
Support TensorFlow 2.5, PyTorch 1.8, and ONNX Runtime 1.8

Validated Configurations:

Python 3.6 & 3.7 & 3.8 & 3.9
Centos 8.3 & Ubuntu 18.04
Intel TensorFlow 1.15.2, 2.1.0, 2.2.0, 2.3.0, 2.4.0, 2.5.0 and 1.15.0 UP1 & UP2 & UP3
PyTorch 1.5.0+cpu, 1.6.0+cpu, 1.8.0+cpu, ipex
MxNet 1.6.0, 1.7.0
ONNX Runtime 1.6.0, 1.7.0, 1.8.0

Distribution:

	Channel	Links	Install Command
Source	Github	https://github.com/intel/lpot.git	$ git clone https://github.com/intel/lpot.git
Binary	Pip	https://pypi.org/project/lpot	$ pip install lpot
Binary	Conda	https://anaconda.org/intel/lpot	$ conda install lpot -c conda-forge -c intel

Contact:

Please feel free to contact [email protected], if you get any questions.

Assets 2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Distribution:

Contact:

Distribution:

Contact:

Distribution:

Contact:

Distribution:

Contact:

Distribution:

Contact:

Distribution:

Contact:

Validated Configurations:

Distribution:

Contact:

Validated Configurations:

Distribution:

Contact:

Validated Configurations:

Distribution:

Contact:

Releases: intel/neural-compressor

Intel® Neural Compressor v1.11 Release

Intel® Neural Compressor v1.10 Release

Distribution:

Contact:

Intel® Neural Compressor v1.9 Release

Distribution:

Contact:

Intel® Neural Compressor v1.8.1 Release

Distribution:

Contact:

Intel® Neural Compressor v1.8 Release

Distribution:

Contact:

Intel® Neural Compressor v1.7.1 Release

Distribution:

Contact:

Intel® Neural Compressor v1.7 Release

Distribution:

Contact:

Intel® Low Precision Optimization Tool v1.6 Release

Validated Configurations:

Distribution:

Contact:

Intel® Low Precision Optimization Tool v1.5.1 Release

Validated Configurations:

Distribution:

Contact:

Intel® Low Precision Optimization Tool v1.5 Release

Validated Configurations:

Distribution:

Contact: